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Ti?l$ °f the Invention 

^S^This invention relates to proteins having 
serine/threonine kinase domains, corresponding nucleic acid 
molecules, and their use. 
partgrpyTifl 9f the invention 

The transforming growth factor-fl (TGF-B) superfamily 
consists of a family of structurally-related proteins, 
10 including three different mammalian isof orms of TGF-B (TGF- 
61, B2 and B3), activins, inhibins, mtillerian-inhibiting 
substance and bone morphogenic proteins (BKFs) (for reviews 
see Roberts and Sporn, (1990) Peptide Growth Factors and 
Their Receptors, Ft.l, Sporn and Roberts, eds. (Berlin: 
15 Springer - Verlag) pp 419-472; Moses £l (1990) Cell £2, 
245-247). The proteins of the TGF-B superfamily have a 
vide variety of biological activities. TGF-B acts as a 
growth inhibitor for many cell types and appears to play a 
central role in the regulation of embryonic development, 
20 tissue regeneration, immuno-regulation, as well as in 
fibrosis and carcinogenesis (Roberts and Sporn (199) see 
above) ♦ 

Activins and inhibins were originally identified as 
factors which regulate secretion of follicle-stimulating 

25 hormone secretion (Vale g£ £l (1990) Peptide Growth Factors 
and Their Receptors, Pt.2, Sporn and Roberts, eds. (Berlin: 
Springer-Verlag) pp. 211-248). Activins were also shown to 
induce the differentiation of haematopoietic progenitor 
cells (Kurata £l (1988) Proc. Natl. Acad. Sci. USA ££, 

30 2434 - 2438; Eto e£ fil (1987) Biochem. Biophys. Res. 
Commun. 142, 1095-1103) and induce mesoderm formation in 
Xenopus embryos (Smith e£ £l (1990) Nature 729-731; 
van den Eijnden-Van Raaij fil (1990) Nature 345 , 732- 
734) . 

35 BKPs or osteogenic proteins which induce the formation 

of bone and cartilage when implanted subcutaneously (Kozney 
e£ £l (1988) Science 242, 1528-1534), facilitate neuronal 




differentiation (Paralxar fit al (1992) J. Cell Biol. us . 
1721-1728) and induce aonocyte cheaotaxis (Cunninghaa fit fil 
(1992) Proc. Natl. Acad. Sci. USA ££, 11740-11744). 
^ Ktillerian-inhibiting substance induces regression of the 
5 Mtillerian duct in the sale reproductive systea (Cats fit fil 
(1986) Cell 4£» 685-698), and a glial cell line-derived 
neurotrophic factor enhances survival of aidbrain 
dopaminergic neurons (Lin fit fil (1993) Science 222., 1130- 
1132). The action of these growth factors is mediated 
10 through binding to specific cell surface receptors. 

Within this family, TGF-fl receptors have been aost 
thoroughly characterized. By covalently cross-linking 
radio-labelled TGF-B to cell surface aolecules followed by 
polyacrylaaide gel electrophoresis of the affinity-labelled 
15 complexes, three distinct size classes of cell surface 
proteins (in aost cases) have been identified, denoted 
receptor type X (53 xd), type II (75 xd) , type III or 
betaglycan (a 300 xd proteoglycan with a 120 xd core 
protein) (for a review see Kassague (1992) Cell f£ 1067- 
20 1070) and aore recently endoglin (a homodiaer of two 95 Xd 
subunits) (Cheifetz fit fil (1992) J. Biol. Chea. 2S1 19027- 
19030). Current evidence suggests that type I and type II 
receptors are directly involved in receptor signal 
transduction (Segarini fit £l (1989) Mol. Zndo., £, 261-272; 
25 Laiho fit fil (1991) J. Biol. Chea. 9100-9112) and say 

form a heteromeric complex; the type II receptor is needed 
for the binding of TGF-B to the type I receptor and the 
type I receptor is needed for the signal transduction 
induced by the type II receptor (Wrana fit fil (1992) Cell, 
30 71, 1003-1004). The type III receptor and endoglin may 
have more indirect roles, possibly by facilitating the 
binding of ligand to type II receptors (Wang fit fil (1991) 
Cell, 12 797-805; Lopez-Casillas fit fii (1993) Cell, 22 
1435-1444). 

35 Binding analyses with activin A and BKP4 have led to 

the identification of two co-existing cross-linxed affinity 
complexes of 50-60 xDa and 70-80 xDa on responsive cells 
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(Hino ££ Ai (1989) J. Bid. Chejn. 10309 * 10314; 

Kathevs and Vale (1991), Cell ££, 775-785; Paralker fit ai 

(1991) Froc. Natl. Acad. Sci. USA £2, 8913-8917). By 
analogy vith TGF-B receptors they are thought to be 

5 signalling receptors and have been named type Z and type ZZ 
receptors. 

Among the type ZZ receptors for the TGF-fl superf aaily 
of proteins, the cOKA for the activin type ZZ receptor (Act 
RII) vas the first to be cloned (Kathevs and Vale (1991) 

10 Cell £i, 973*982). The predicted structure of the receptor 
vas shown to be a transmembrane protein vith an 
intracellular serine/threonine kinase domain. The activin 
receptor is related to the C. eleaans iial-l gene product, 
but the ligand is currently unknown (Georgi ££ jsl (1990) 

15 Cell £1, 635-645). Thereafter, another form of the activin 
type ZZ receptor (activin type ZZB receptor) , of vhich 
there are different splicing variants (Kathevs ££ £l 

(1992) , Science 221, 1702-1705; Attisano £l (1992) Cell 
1&, 97-108), and the TGF-fl type ZZ receptor (TfiRZZ) (Lin 

20 £l (1992) Cell £&, 775-785) were cloned, both of vhich have 
putative serine/threonine kinase domains. 
Summary of the Invention 

The present invention involves the discovery of 
related novel peptides, including peptides having the 

25 activity of those defined herein as SEQ ZD Nos. 2, 4, 8, 
10, 12, 14, 16 and 18. Their discovery is based on the 
realisation that receptor serine/threonine kinases for* a 
nev receptor family, vhich may include the type ZZ 
receptors for other proteins in the TCF-S superfamily. To 

30 ascertain vhether there vere other members of this family 
of receptors, a protocol vas designed to clone ActRII/ daf 
Z related cDKAs • This approach made use of the polymerase 
chain reaction (PCR) , using degenerate primers based upon 
the amino-acid sequence similarity betveen kinase domains 

35 of the mouse activin type ZZ receptor and dag-Z gene 
products* 



This strategy resulted in the isolation of a new 
faaily of receptor kinases called Aetivin receptor like 
kinases (ALK's) 1-6. These cDNAs showed an overall 33-39% 
sequence similarity with ActRII and TCP-B type II receptor 
5 and 40-92% sequence similarity towards sach other in the 
kinase domains. 

Soluble receptors according to the invention coaprise 
at least predoainantly the extracellular doaain. These can 
be selected froa the inferaation provided herein, prepared 
10 in conventional Banner, and used in any aanner associated 
with the invention. 

Antibodies to the peptides described herein may be 
raised in conventional aanner. By selecting unique 
sequences of the peptides, antibodies having desired 
15 specificity can be obtained. 

The antibodies Bay be aonoclonal, prepared in known 
aanner. In particular, aonoclonal antibodies to the 
extracellular doaain are of potential value in therapy. 

Products of the invention are useful in diagnostic 
20 methods, e.g. to determine the presence in a sample for an 
analyte binding therewith, such as in an antagonist assay. 
Conventional techniques, e.g. an enzyme-linked 
immunosorbent assay, Bay be used. 

Products of the invention having a specific receptor 
25 activity can be used in therapy, e.g. to aodulate 
conditions associated with activin or TGF-0 activity. Such 
conditions include fibrosis, e.g. liver cirrhosis and 
pulaonary fibrosis, cancer, rheumatoid arthritis and 
glomeronephritis. 
30 p f j tef Description of the Drawings 

Figure 1 shows the alignment of the serine/threonine 
(S/T) kinase doaains (I-VZZZ) of related receptors from 
transnembrane proteins, including embodiaents of the 
present invention. The nomenclature of the subdooains is 
35 accordingly to Hanks gt ai (1988).. 

Figures 2A to 2D shows the sequences and 
characteristics of the respective primers used in the 



initial PCR reactions. The nucleic acid sequences are also 
given as SEQ ZD Kos. 19 to 22. 

Figure 3 is a comparison of the * ain °£|gj d ) 4$ e J5 ,ences 
£ of human activin type II rece^tqrJAct R-IIJ^aouse activin 

5 type — ' " ^ ' 

(TBR 



^ 1 & 3^ and aouse ALK-^. 

Figure 4 shows, schematically, the structures for Eat- 
10 1, Act R-II, Act R-IIB, TBR-II, TBR-I/ALK-5, AIK'S -1, -2 
(Act RIA) , -3, -4 (Act RIB) & -6. 




O^U rich io^ains of the AI* s. TbI^ 

15 Figure € is a comparison of kinase domains of 

serine/threonine kinases, showing the percentage amino-acid 

identity of the kinase domains. 

Figure 7 shows the pairvise alignment relationship 

between the kinase domains of the receptor serine/threonine 
20 kinases. The dendrogram was generated using the Jotun-Hein 

alignment program (Hein (1990) Heth. Enzymol. 1£2, 626- 

645). 

T rfel; Description of Seeruenee Listinos 

Sequences 1 and 2 are the nucleotide and deduced 
25 amino-acid sequences of cDNA for hALK-1 (clone HP57) . 

Sequences 3 and 4 are the nucleotide and deduced ~ 
amino-acid sequences of cDNA f or hAI£-2 (clone EP53). 

Sequences 5 and 6 are the nucleotide and deduced 
amino-acid sequences of cDNA for faALK-3 (clone 0NF5) • 
30 Sequences 7 and 8 the nucleotide and deduced amino- 

acid sequences of cDNA for hALK-4 (clone 11H8), 
complemented with PCR product encoding extracellular 
domain. 

Sequences 9 and 10 are the nucleotide and deduced 
35 amino-acid sequences of cDNA for hALK-5 (clone EKBLA) . 

Sequences 11 and 12 are the nucleotide and deduced 
amino-acid sequences of cDNA for mALK-1 (clone AK6). 



Sequences 13 and 14 are the nucleotide and deduced 
amino-acid sequences of cDNA for nALK-3 (clones KE-7 and 
KE-D). 

Sequences 15 and 16 are the nucleotide and deduced 
amino-acid sequences of cDNA for xaALK-4 (clone 8al). 

Sequences 17 and 18 are the nucleotide and deduced 
anino-acid sequences of cDNA for ttAIK-6 (clone KE-6) . 

Sequence 19 (Bl-S) is a sense primer, extracellular 
domain, cysteine-rich region, BamHI site at 5' end, 28-mer, 
6 4 -fold degeneracy. 

Sequence 20 (B3-S) is a sense primer, kinase domain 
II, BamHI site at 5' end, 25-ner, 162-fold degeneracy. 

Sequence 21 (B7-S) is a sense primer, kinase domain 
VIB, S/T kinase specific residues, BamHI site at 5' end, 
24-mer, 288-fold degeneracy. 

Sequence 22 (E8-AS) is an anti-sense primer, kinase 
domain, S/T kinase-specif ic residues EcoRI site at 5' end, 
20-»er, 18-fold degeneracy. 

Sequence 23 is an oligonucleotide probe. 

Sequence 24 is a 5' primer. 

Sequence 25 is a 3' primer. 

Sequence 26 is a consensus sequence in Subdomain I. 
Sequences 27 and 28 are novel sequence motifs in 
Subdomain VIB. 

Sequence 29 is a novel sequence motif in Subdomain 

VIII. 

Description of t he Tnvention 

As described in more detail below, nucleic acid 
sequences have been isolated, coding for a new sub-family 
of serine/threonine receptor kinases. The term nucleic 
acid molecules as used herein refers to any sequence which 
codes for the murine, human or mammalian form, amino-acid 
sequences of which are presented herein. It is understood 
that the well known phenomenon of codon degeneracy provides 
for a great deal of sequence variation and all such 
varieties are included within the scope of this invention. 



The nucleic acid sequences described herein aay be 
used to clone the respective genomic DNA sequences in order 
to study the genes' structure and regulation. The aurine 
and human cDNA or genomic sequences can also be used to 
isolate the homologous genes from other mammalian species. 
The mammalian DNA sequences can be used to study the 
receptors' functions in various In vitro and in vivo model 
systems. 

As exemplified belov for ALK-5 cDNA, it is also 
recognised that, given the sequence information provided 
herein, the artisan could easily combine the molecules vita 
a pertinent promoter in a vector, so as to produce a 
cloning vehicle for expression of the molecule. The 
promoter and coding molecule must be operably linked via 
any of the veil-recognized and easily-practised 
methodologies for so doing. The resulting vectors, as veil 
as the isolated nucleic acid molecules themselves, aay be 
used to transform prokaryotic cells (e.g. J. coll i . or 
transfect eukaryotes such as yeast (£. cerevisiae ) . ?AE, 
COS or CHO cell lines. Other appropriate expression 
systems vill also be apparent to the skilled artisan. 

Several methods aay be used to isolate the ligands for 
the ALKs. As shown for ALK-5 cDNA, cDNA clones encoding 
the active open reading frames can be subcloned into 
expression vectors and transfected into eukaryotic cells, 
for example COS cells. The transfected cells vhich can 
express the receptor can be subjected to binding assays for 
radioactively-labelled members of the TGF-B superfaaily 
(TGF-B, activins, inhibins, bone morphogenic proteins and 
niilleri an- inhibiting substances), as it aay be expected 
that the receptors vill bind members of the TGF-B 
superfamily. Various biochemical or cell-based assays can 
be designed to identify the ligands, in tissue extracts or 
conditioned media, for receptors in vhich a ligand is not 
known. Antibodies raised to the receptors may also be used 
to identify the ligands, using the iamunoprecipitation of 
the cross-linked complexes. Alternatively, purified 
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receptor could be used to isolate the ligande using an 
affinity-based approach. The determination of the 
expression patterns of the receptors nay also aid in the 
isolation of the ligand. These studies ©ay be carried out 
5 using ALK OKA or SNA sequences as probes to perform in situ 
hybridisation studies. 

The use of various model systems or structural studies 
6hould enable the rational development of specific agonists 
and antagonists useful in regulating receptor function. It 

10 may be envisaged that these can be peptides, mutated 
ligands, antibodies or other molecules able to interact 
with the receptors. 

The foregoing provides examples of the invention 
Applicants intend to claim vhich includes, inter alia, 

15 isolated nucleic acid molecules coding for activin 
receptor-like kinases (ALKs) , as defined herein* These 
include such sequences isolated from mammalian species such 
as mouse, human, rat, rabbit and monkey. 

The following description relates to specific 

20 embodiments. Zt vill be understood that the specification 
and examples are illustrative but not limitative of the 
present invention and that other embodiments within the 
spirit and scope of the invention vill suggest themselves 
to those skilled in the art. 

25 Preparation of xnRNA and Construction of a cDNA Library 

For construction of a cDKA library, poly (A) # RNA was 
isolated from a human erythroleukemia cell line (EEL 
92.1.7) obtained from the American Type Culture Collection 
(ATCC TIB 180). These cells were chosen as they have been 

30 shown to respond to both activin and TGF-B. Moreover 
leukaemic cells have proved to be rich sources for the 
cloning of novel receptor tyrosine kinases (Partanen £t al 
(1990) Froc. Natl. Acad. Sci. USA £2, 8913-8917 and (1992) 
Kol. Cell. Biol. 12, 1698-1707), (Total) RNA was prepared 

35 by the guanidinium isothiocyanate ©ethod (Chirgwin £i ftl 
(1979) Biochemistry 1&, 5294-5299). bRNA was selected 
using the poly-A or poly AT tract mRNA isolation kit 
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(Promega, Madison, Wisconsin, U.S.A.) as described by the 
manufacturers, or purified through an oligo (dT) -cellulose 
column as described by Aviv and Leder (1972) Proc. Katl. 
Acad. Sci. USA £2, 1408-1412. The isolated aRNA vas used 
for the synthesis of randoa priaed (Aaershaa) cDNA, that 
vas used to aake a JlgtlO library with 1x10 s independent 
cDKA ciones using the Riboclone cDNA synthesis systea 
(Pronega) and AgtlO in vitro packaging Xit (Aaershaa) 
according to the aanufacturers' procedures. An amplified 
oligo (dT) primed human placenta 1ZAPII cDNA library of 
5x10 s independent clones vas used. Poly (A)* RNA isolated 
froa AS1518 human foreskin fibroblasts vas used to prepare 
a primary randoa primed IZAPII cDNA library of 1.5x10* 
independent clones using the RiboClene cDKA synthesis 
system and Cigapack Gold II packaging extract (Stratagene) . 
In addition, a primary oligo (dT) priaed human foreskin 
fibroblast AgtlO cDNA library (Claesson-Welsh t£ al (1989) 
Proc. Natl. Acad. Sci. USA. ££ 4917-4912) vas prepared. An 
amplified oligo (dT) primed HEL cell igtll cDNA library of 
1.5 X 10* independent clones (Pones e£ £l (1987) Blood 
219-223) vas used. A twelve-day aouse embryo A EXT ex cDNA 
library vas obtained from Hovagen (Kadison, Wisconsin, 
U.S.A.); a mouse placenta A Z API I cDKA library vas also 
used. 

Generation of cDKA Probes bv PCR 

For the generation of cDNA probes by PCR (Lee j&£ a_l 
(1988) Science 221, 1288-1291) degenerate PCR primers vere 
constructed based upon the amino-acid seguence similarity 
between the mouse activin type II receptor (Mathews and 
Vale (1991) Cell ££, 973-982) and daf-l (George jJi fil 
(1990) Cell 11, 635-645) in the kinase domains II and VTII. 
Figure 1 shows the aligned 6erine/threonine kinase domains 
(l-vill), of four related receptors of the TGF-B 
superfamily, i.e. hTBR-II, mActR-IIB, mActR-Il and the daf- 
1 gene product, using the nomenclature of the subdomains 
according to Hanks §1 £l (1988) Science £41, 45-52. 
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Several considerations vere applied in the design of 
the PCR primers. The sequences vere taken from regions of 
homology between the activin type II receptor and the flal-l 
gene product, with particular emphasis on residues that 
5 confer serine/threonine specificity (see Table 2) and on 
residues that are shared by transmembrane kinase proteins 
and not by cytoplasmic kinases. The primers vere designed 
so that each primer of a PCR set had an approximately 
similar GC composition, and so that self complementarity 

10 and complementarity between the 3' ends of the primer sets 
vere avoided. Degeneracy of the primers was kept es low as 
possible, in particular avoiding serine, leucine and 
arginine residues (6 possible codons) , and human codon 
preference vas applied. Degeneracy vas particularly 

15 avoided at the 3' end as, unlike the 5' end, where 
mismatches are tolerated, mismatches at the 3' end 
dramatically reduce the efficiency of PCR. 

In order to facilitate directional subcloning, 
restriction enzyme sites vere included at the 5' end of the 

20 primers, with a CC clamp, which permits efficient 
restriction enzyme digestion. The primers utilised are 
shown in Figure 2. Oligonucleotides were synthesized using 
Gene assembler plus (Pharmacia - LKB) according to the 
manufacturers instructions. 

25 ' The mRKA prepared from KEL cells as described above 

vas reverse-transcribed into cDNA in the presence of SO mM 
Tris-HCl, pH 8.3, 8 mM MgCl 2 , 30 mM KCl, 10 mM 
dithiothreitol, 2mM nucleotide triphosphates, excess oligo 
(dT) primers and 34 units ©f AHV reverse transcriptase at 

30 42°C for 2 hours in 40 pi of reaction volume. 
Amplification by PCR was carried out with a 7.5% aliquot (3 
pi) of the reverse-transcribed mRNA, in the presence of 10 
mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 K MgCl 2 , 0.01% gelatin, 
0.2 mM nucleotide triphosphates, 1 pM of both sense and 

35 antisense primers and 2.5 units of Taq polymerase (Perkin 
Elmer Cetus) in 100 pi reaction volume. Amplifications 
vere performed on a thermal cycler (Perkin Elmer Cetus) 
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using the following program: first S thermal cycles with 
denaturation for 1 minute at 94°C, annealing for 1 ainute 
at 50°C, a 2 ainute raap to 55°C and elongation for 1 ainute 
at 72°C, followed by 20 cycles of 1 ainute at 94 e C, 30 
5 seconds at 55°C and 1 ainute at 72*C. A second round of PCR 
was performed with 3 jrt of the first reaction as a 
template. This Involved 25 theraal cycles, each composed 
of 94°C (1 Bin), 55*C (0.5 ain), 72°C (l ain) . 

General procedures such as purification of nucleic 

10 acids, restriction enzyme digestion, gel electrophoresis, 
transfer of nucleic acid to solid supports and subcloning 
were performed essentially according to established 
procedures as described by Sambrook ejfc £l, (1989), 
Molecular cloning: A Laboratory Manual, 2 nd Ed. Cold Spring 

15 Harbor Laboratory (Cold Spring Harbor, Hew York, USA) • 

Samples of the PCR products were digested with BamH l 
and EsoRI and subsequently fractionated by low Belting 
point agarose gel electrophoresis. Bands corresponding to 
the approximate expected sizes, (see Table 1: s460 bp for 

20 primer pair B3-S and E8-AS and » 140 bp for primer pair B7- 
S and E8-AS) were excised from the gel and the DKA was 
purified. Subsequently, these fragments were liga ted into 
pUC19 (Yanisch-Perron £fc ai, (1985) Gene 22, 103-119) , which 
had been previously linearised with BasH I and EcoR l and 

25 transformed into £. col^ strain DH5a using standard 
protocols (Sambrook £ifil, £^£I&) • Individual clones were 
sequenced using standard double-stranded sequencing 
techniques and the dideoxynucleotide chain termination 
method as described by Sanger e_£ a1 (1977) Proc. Natl. 

30 Acad. Sci. USA 21, 5463-5467, and T7 DNA polymerase. 

Employing Reverse Transcriptase PCR on HEL mRKA with 
the primer pair S3-S and E8-AS, three PCR products were 
obtained, termed 11.1, 11.2 and 11.3, that corresponded to 
novel genes. Using the primer pair B7-S and £8 -AS, an 

35 additional novel PCR product was obtained termed 5.2. 



• # 



12 



KXXZ 
Of PCX 

PiODOCT 


FRXXERS 


IKEERX 
SXSS 


fill OF DK* 

FJJIQCXWT IX 

mXctXll/ 

tSBUX 

GAXIS 

(bp) 


fBQTEXCB 
XDEXTICT KITH 
SBQUKXC1 
aAetXXX/hXBRXX 
(1) 

\ " r 


IBQUZVCB 

ID on ITT 

ERVEBt 

aXctJUX 

and 

SBR-XX 

r%> 


11.1 


B3-S/E6-XS 


460 


460 


46/40 


42 


11.2 


B3-S/I6-XS 


460 


460 


49/44 


47 


11.3 


B3-S/B6-AS 


460 


460 


44/96 


46 


11.29 


B3-5/B8-AS 


460 


460 


TO/100 


HD 


9.2 


I1-S/E8-AS 


600 


795 


100/HD 


HO 


5.2 


B7-S/28-AS 


140 


143 


40/38 


60 



15 Isolation cf cDNA Clones 

The PCR products obtained vere used to screen various 
cDNA libraries described £UECft* Labelling of the inserts 
of PCR products was performed using random priming method 
(Feinberg and Vogelstein (1983) Anal. Biochem, 132 6-13) 

20 using the Kegaprime DNA labelling system (Amersham) . The 
oligonucleotide derived from the sequence of the PCR 
product 5.2 vas labelled by phosphorylation vith T4 
polynucleotide kinase following standard protocols 
(Sambrook e£ 4l, supra ) . Hybridization and purification of 

25 positive bacteriophages vere performed using standard 
molecular biological techniques. 

The double-stranded DNA clones vere all sequenced 
using the dideoxynucleotide chain-termination method as 
described by Sanger £±, supra f using T7 DNA polymerase 

30 (Pharmacia - LKB) or Sequenase (U.S. Biochemical 
Corporation, Cleveland, Ohio, U.S.A.) • Compressions of 
nucleotides vere resolved using 7-deaza-GTP (U.S. 
Biochemical Corp*) DNA sequences vere analyzed using the 
DNA STAR computer program (DNA STAR Ltd. U.K.). Analyses 

35 of the sequences obtained revealed the existence of six 
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distinct putative receptor serine/threonine kinases vhich 
have been named ALK 1*6. 

To clone cDNA for ALK-1 the oligo (dT) priced human 
placenta cDNA library vas screened with a radiolabeled 
5 insert derived from the PCR product 11.3; based upon their 
restriction enzyme digestion patterns, three different 
types of clones vith approximate insert sizes, of 1.7 kb, 
2 kb & 3.5 kb vere identified. The 2 kb clone, named 
HP57, vas chosen as representative of this class and 

10 subjected to complete sequencing. Sequence analysis of 
ALK-1 revealed a sequence of 1984 nucleotides including a 
poly-A tail (SEQ ID No. 1) . The longest open reading frame 
encodes a protein of 503 amino-acid^, vith high sequence 
similarity to receptor serine/threonine kinases (see 

15 belov) • The first methionine cod on, the putative 
translation start site, is at nucleotide 283-285 and is 
preceded by an in-frame stop codon. This first ATG is in 
a more favourable context for translation initiation (Kozak 
(1987) Kucl. Acids Res., 1£, 8125-8148) than the second and 

20 third in-frame ATG at nucleotides 316-318 and 325-327. The 
putative initiation codon is preceded by a S ; untranslated 
sequence of 282 nucleotides that is GC-rich (80% GC) , vhich 
is not uncommon for growth factor receptors (Ko2ak (1991) 
J. Cell Biol., US, 887-903). The 3' untranslated sequence 

25 comprises 193 nucleotides and ends vith a poly-A tail. No 
bona fide poly-A addition signal is found, but there is a 
sequence (AATACA) , 17-22 nucleotides upstream of the poly-A 
tail, vhich may serve as a poly-A addition signal. 

ALX-2 cDNA vas cloned by screening an amplified oligo 

30 (dT) primed human placenta cDNA library vith a 
radiolabeled insert derived from the PCR product 11.2. 
Two clones, termed HP53 and KP64, vith insert sizes of 2.7 
kb and 2.4 kb respectively, vere identified and their 
sequences vere determined. No sequence difference in the 

35 overlapping clones vas found, suggesting they are both 
derived from transcripts of the same gene. 
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Sequence analysis of cDKA clone HP 5 3 (SEQ ZD No. 3) 
revealed a sequence of 2719 nucleotides with a poly -A tail. 
The longest open reading frame encodes a protein of 509 
amino-acidsf. The first ATC at nucleotides 104-106 agrees 
S favourably with Kozak's consensus sequence vith an A at 
position 3. This ATC is preceded in-frame by a stop cod on. 
There are four ATG codons in close proximity further 
downstream, vhich agree with the Kozak's consensus sequence 
(Xozak, supra 1 , but according to KozaX's scanning model the 
10 first ATC is predicted to be the translation start site. 
The 5' untranslated sequence is 103 nucleotides. The 3' 
untranslated sequence of 1089 nucleotides contains a 
polyadenylation signal located 9*14 nucleotides upstream 
from the poly-A tail. The cDKA clone HP64 lacks 498 
15 nucleotides from the 5' end compared to HF53, but the 
sequence extended at the 3' end with 190 nucleotides and 
poly-A tail is absent. This suggests that different 
polyadenylation sites occur for ALK-2. In Northern blots, 
however, only one transcript vas detected (see belov). 
20 The cDKA for human ALK-3 vas cloned by initially 

screening an oligo (dT) primed human foreskin fibroblast 
cDKA library with an oligonucleotide (SEQ ZD No. 23) 
derived from the PCR product 5.2. One positive cDKA clone 
with an insert size of 3 kb, termed ON11, vas identified. 
25 However, upon partial sequencing, it appeared that this 
clone vas incomplete; it encodes only part of the kinase 
domain and lacks the extracelluar domain. The most 5' 
sequence of ON11, a 540 nucleotide restriction 
fragment encoding a truncated kinase domain, vas 
30 subsequently used to probe a random primed fibroblast cDKA 
library from vhich one cDKA clone vith an insert site of 3 
kb, termed ONF5, vas isolated (SEQ ID No. 5). Sequence 
analysis of 0NF5 revealed a sequence of 2932 nucleotides 
without a poly-A tall, suggesting that this clone vas 
35 derived by internal priming. The longest ^^^^f^in? 
C frame codes for a protein of 532 amino-acid^. The first 

ATG cod on vhich is compatible vith KozaX's consensus 
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sequence (Kozak, supra 1 . is at 310-312 nucleotides and is 
preceded by an in-frame stop codon« The 5 ' and 3 ' 
untranslated sequences are 309 and 1027 nucleotides long, 
respectively. 

5 ALK-4 cDKA vas identified by screening a human cligo 

(dT) prised human erythroleuXemia cDHA library vith the 
radiolabeled insert of the PGR product 11.1 as a probe. 
One cDNA clone, termed 11H8, vas identified vith an insert 

size of 2 kb (SEQ ZD Ho. 7). An open reading frame vas_ 

SEQ ID H6: 

10 found encoding a protein sequence of 383 amino-acids^ 
encoding a truncated extracellular domain vith high 
similarity to receptor serine/threonine kinases. The 3' 
untranslated sequence is 818 nucleotides and does not 
contain a poly-A tail, suggesting that the cDNA vas 

15 internally primed. cDNA encoding the complete 

extracellular domain (nucleotides 1-366) vas obtained from 
HEL cells by RT-PCR vith 5' primer (SEQ ID No. 24) derived 
in part from sequence at translation start site of SKR-2 (a 
cDNA sequence deposited in GenBank data base, accesion 

20 number LI 0125, that is identical in part to ALK-4) and 3' 
primer (SEQ ID No. 25) derived from 11H8 cDNA clone. 

ALK-5 vas identified by screening the random primed 
HEL cell Igt 10 cDNA library vith the PCR product 11.1 as 
a probe. This yielded one positive clone termed EKBLA 

25 (insert size of 5.3 Kb vith 2 internal EcoRI sites). 
Nucleotide sequencing revealed an open reading frame of 
1509 bp, coding for 503 amino-acids. The open reading 
frame vas flanked by a 5' untranslated sequence of 76 bp, 
and a 3' untranslated sequence of 3.7 kb vhich vas not 

30 completely sequenced. The nucleotide and deduced amino- 
acid sequences of ALK-5 are shown in SEQ ID Nos. 9 and 10. 
In the 5' part of the open reading frame, only one ATG 
codon' vas found; this codon fulfils the rules of 
translation initiation (Kozak, supra ) . An in-frame stop 

35 codon vas found at nucleotides (-54) -(-52) in the 5' 
untranslated region. The predicted ATG start codon is 
fol loved by a stretch of hydrophobic amino-acid residues 
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which has characteristics of a cleavable signal sequence. 
Therefore, the first ATG codon is likely to be used as a 
translation initiation site. A preferred cleavage site for 
the signal peptidase, according to von Beijne (1986) Nucl. 
5 Acid. Res. 1£, 4683-4690, is located between amino-acid 
residues 24 and 25. The calculated molecular mass of the 
primary translated product of the ALK-5 without signal 
sequence is 53,646 Da. 

Screening of the mouse embryo A EX l2Jt cDNA library 

10 using PCR, product 11.1 as a probe yielded 20 positive 
clones. DKAs from the positive clones obtained from this 
library were digested with £c£RI and pipd III P 
electrophoretically separated on a 1,3% agarose gel and 
transferred to nitrocellulose filters according to 

15 established procedures as described by Sambrook £fc al . 
suora . The filters were then hybridized with specific 
probes for human ALK-1 (nucleotide 288*670) , ALK-2 
(nucleotide 1-581) f ALK-3 (nucleotide 79-824) or AIX-4 
nucleotide 1178*1967) • Such analyses revealed that a clone 

20 termed KE-7 hybridised with the human ALK-3 probe. 
However, nucleotide sequencing revealed that this clone was 
incomplete, and lacked the 5' part of the translated 
region. Screening the same cDKA library with a probe 
corresponding to the extracelluar domain of human ALK-3 

25 (nucleotides 79-824) revealed the clone KE-D. This clone 
was isolated and the sequence was analyzed. Although this 
clone was incomplete in the 3' end of the translated 
region, KE-7 and KE-D overlapped and together covered the 
complete sequence of mouse ALK-3. The predicted amino-acid 

30 sequence of mouse ALK-3 is very similar to the human 
sequence; only 8 amino-acid residues differ (98% identity; 
see SEQ ID No. 14) and the calculated molecular mass of the 
primary translated product without the putative signal 
sequence is 57,447 Da. 

35 Of the clones obtained from the initial library 

screening with PCR product 11.1, four clones hybridized to 
the probe corresponding to the conserved kinase domain of 
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ALK-4 but not to probes from mora divergent parts of ALK-1 
to -4. Analysis of these clones revealed that they have an 
identical sequence which differs from those of ALK-1 to -5 
and vas termed ALK-6. The longest clone KE6 with a 2.0 Xb 
5 insert vas completely sequenced yielding a 1952 bp fragment 
consisting of an open reading frame of 1506 bp (502 amino- 
acids), flanked by a 5' untranslated sequence of 186 bp, 
and a 3' untranslated sequence of 160 bp. The nucleotide 
and predicted amino-acid sequences of souse ALK-6 are shown 

10 in SEQ ZD Nos. 17 and 18. No polyadenylation signal vas 
found in the 3' untranslated region of ME6, indicating that 
the cDNA vas internally primed in the 3' end* Only one AT6 
codon vas found in the 5' part of the open reading frame, 
vhich fulfils the rules for translation initiation (Ko2ak, 

15 supra ) , and vas preceded by an in-frame stop codon at 
nucleotides 163-165. Hovever, a typical hydrophobic leader 
sequence vas not observed at the N terminus of the 
translated region. Since there is no ATC codon and 
putative hydrophobic leader sequence, this ATG codon is 

20 likely to be used as a translation initiation site. The 
calculated molecular mass of the primary translated product 
with the putative signal sequence is 55,576 Da* 

Mouse ALK-1 (clone AM6 with 1.9 kb insert) vas 
obtained from the mouse placenta AZAFII cDKA library using 

25 human ALK-1 cDNA as a probe (see SEQ ZD No. 11). Mouse 
ALK-4 (clone 8al vith 2.3kb insert) vas also obtained from 
this library using human ALK-4 cDNA library as a probe (SEQ 
ID No. 15). 

To summarise, clones HF22, HP57, OHF1, ONF3, ONP4 and 
30 HP29 encode the same gene, ALK-1. Clone AM6 encodes mouse 
ALK-1. HP53, EF64 and EP84 encode the same gene, ALK-2. 
ONF5, ONF2 and ON11 encode the same gene ALK-3. KE-7 and 
ME-D encode the souse counterpart of human ALK-3. 11B8 
encodes a different gene ALK-4, vhilst 8al encodes the 
35 mouse equivalent. EMBLA encodes ALK-5, and ME -6 encodes 
ALK-6. 
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The sequence alignment between the 6 ALK genes and 
TBR-IX, aActR-IX and ActR-IIB is shown in Figure 3. These 
molecules have a similar domain structure; an N-terminal 
predicted hydrophobic signal sequence (von Heijne (1986) 
5 Nucl. Acids Res. 14,: 4683-4690) is followed by a relatively 
small extracellular cysteine-rich ligand binding domain, a 
single hydrophobic transmembrane region (Kyte a Doolittle 
(1982) J. Mol. Biol. 121, 105-132) and a C-terminal 
intracellular portion, which consists almost entirely of a 

10 kinase domain (Figures 3 and 4). 

The extracelluar domains of these reeeptors have 
cysteine-rich regions, but they show little sequence 
similarity; for example, less than 20% sequence identity is 
found between Daf-1. ActR-II, TBR-II and ALK-5. The ALKs 

15 appear to form a subfamily as they show higher sequence 
similarities (15-47% identity) in their extracellular- 
domains. The extracellular domains of ALK-5 and ALK-4 have 
about 29% sequence identity. In addition, ALK-3 and ALK-6 
share a high degree of sequence similarity in their 

20 extracellular domains (46% identity). 

The positions of many of the cysteine residues in all 
receptors can be aligned, suggesting that the extracellular 
domains may adopt a similar structural configuration. See 
Figure 5 for ALKs-1,-2,-3 *- 5. Each of the ALKs (except 

25 ALK-6) has a potential N-linked glycosylation site, the 
position of which is conserved between ALK-1 and ALK-2, and 
between ALK-3, ALK-4 and ALK-5 (see Figure 4). 

The sequence similarities in the kinase domains 
between daf-1. ActR-II , TBR-II and ALK-5 are approximately 

30 40%, whereas the sequence similarity between the ALKs 1 to 
6 is higher (between 59% and 90%; see Figure 6). Fairvise 
comparison using the Jutun-Hein sequence alignment program 
(Hein (1990) Heth, Enzymol., lilt 626-645), between all 
family members, identifies the ALKs as a separate subclass 

35 among serine/threonine kinases (Figure 7). 

The catalytic domains of kinases can be divided into 
12 subdomains with stretches of conserved amino-acid 



19 

residues. The key aotifs are found in serine/threonine 
kinase receptors suggesting that they are functional 
kinases. The consensus j ^ence for the binding of ATP 
(Gly-X-Gly-X-X-Glfyin subdoaain I followed by a Lys residue 
further downstreaa in subdoaain II) is found in all the 
ALKs. 

The kinase doaains of ActR-II, and ALKs show 

approxiaately equal sequence similarity with tyrosine and 
serine/threonine protein kinases. However analysis of the 
aaino-acid sequences in subdoaains VI and VIII, which are 
the aost useful to distinguish a specificity for 
phosphorylation of tyrosine residues versus 
serine/threonine residues (Hanks si £l (1988) Science 211 
42-52) indicates that these kinases are serine/threonine 
kinases; refer to Table 2. 
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The sequence motifs DLKSKN (Subdomain VIB) and GTKRYM 
(Subdomain VIII), that are found in most of the 
serine/threonine kinase receptors, agree veil with the 

15 consensus sequences for all protein serine/threonine kinase 
receptors in these regions. In addition, these receptors, 
except for ALK-1, do not have a tyrosine residue surrounded 
by acidic residues between subdomains VII and VIII, which 
is common for tyrosine kinases. A unique characteristic of 

20 the members of the ALK serine/threonine kinase receptor 
family is the presence of two short inserts in the kinase 
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domain between subdomains VIA and VZB and between 
subdomains X and XZ. In the intracellular doaain, these 
regions, together with the juxtaaembrane part and C- 
terminal tail, are the nost divergent between faaily 
5 members (see Figures 3 and 4). Based on the sequence 
siailarity with the type II receptors for TGF-B and 
activin, the C termini of the kinase doaains of ALXs -1 to 
-6 are set at Ser-495, Ser-501, Ser-527, Gln-500, Gln-498 
and Ser-497, respectively. 

10 r T?w& Expression 

The distribution of ALK-1, -2, -3, -4 was determined 
by Northern blot analysis. A Northern blot filter with 
elRNAs from different human tissues was obtained from 
Clontech (Palo Alto, C.A.). The filters were hybridized 

15 with ^-labelled probes at 42*C overnight in 50% 
formaldehyde, 5 x standard saline citrate (SSC; lxSSC is 
SCaM sodium citrate, pH 7.0, 150 sM NaCl) , 0.1% SDS, 50 mM 
sodium phosphate, 5 x Denhardt's solution and 0.1 mg/al 
salmon sperm DNA. In order to minimize cross- 

20 hybridization, probes were used that did not encode part of 
the kinase domains, but corresponded to the highly diverged 
sequences of either 5' untranslated and ligand-binding 
regions (probes for ALK-1, -2 and -3) or 3' untranslated 
sequences (probe for ALK-4) . The probes were labelled by 

25 random priming using the Kultipriae (or Mega-prime) DNA 
labelling system and [a- S2 PJ dCTP (Feinberg & Vogelstein 
(1983) Anal. Biochem. 122« 6-13). Unincorporated label was 
removed by Sephadex G-25 chromatography. Filters were 
washed at 65*C, twice for 30 minutes in 2.5 x SSC, 0.1% SDS 

30 and twice for 30 minutes in 0.3 x SSC, 0.1% SDS before 
being exposed to X-ray film. Stripping of blots was 
performed by incubation at 90-100*C in water for 20 
minutes. 

The ALK-5 m£NA size and distribution were determined 
35 by Northern blot analysis as above. An £so.Rl fragment of 
980bp of the full length ALK-5 cDNA clone, corresponding to 
the C-terminal part of the kinase domain and 3' 
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untranslated region (nucleotides 1259-2232 in SEQ ZD No. 9) 
vas used as a probe. The filter vas washed twice in 0.5 x 
SSC, 0.1% SDS at 55°C for 15 minutes. 

Using the probe for ALK-1, two transcripts of 2.2 and 
5 4.9ko vere detected. The ALK-1 expression level varied 
strongly between different tissues, high in placenta and 
lung # moderate in heart, muscle and kidney, and low (to not 
detectable) in brain, liver and pancreas. The relative 
ratios between the two transcripts were similar in most 

10 tissues; in kidney, however, there vas relatively more of 
the 4.9 kb transcript. By reprobing the blot with a probe 
for ALK-2, one transcript of 4.0 kb was detected with a 
ubiquitous expression pattern. Expression was detected in 
every tissue investigated and was highest in placenta and 

15 skeletal muscle. Subsequently the blot was reprobed for 
ALK-3. One major transcript of 4.4 kb and a minor 
transcript of 7.9 kb were detected. Expression was high in 
skeletal muscle, in which also an additional minor 
transcript of 10 kb was observed. Moderate levels of AXK-3 

20 mRNA were detected in heart, placenta, kidney and pancreas, 
and low (to not detectable) expression was found in brain, 
lung and liver. The relative ratios between the different 
transcripts were similar in the tested tissues, the 4.4 kb 
transcript being the predominant one, with the exception 

25 for brain where both transcripts were expressed at a 
similar level. Probing the blot with ALK-4 indicated the 
presence of a transcript with the estimated site of 5.2 kb 
and revealed an ubiquitous expression pattern. The results 
of Northern blot analysis using the probe for XLK-5 showed 

30 that a 5.5 kb transcript is expressed in all human tissues 
tested, being most abundant in placenta and least abundant 
in brain and heart* 

The distribution of mRNA for mouse ALK-3 and -6 in 
various mouse tissues was also determined by Northern blot 

35 analysis. A multiple mouse tissue blot was obtained from 
Clontech, Palo Alto, California, U.S.A. The filter was 
hybridized as described above with probes for mouse ALK-3 
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and ALK-6. The £££RI-££fcI restriction fragment, 
corresponding to nucleotides 79*1100 of ALK-3, and the 
£ad - Hpa I fragment, corresponding to nucleotides 57-720 of 
ALK-6, were used as probes. The filter vas washed at 65*C 
twice for 30 minutes in 2.5 x SSC, 0.1% SDS and twice for 
30 minutes with 0.3 x SSC, 0.1% SDS and then subjected to 
autoradiography. 

Using the probe for mouse ALK-3, a 1.1 kb transcript 
was found only in spleen. By reprobing the blot with the 
ALK-6 specific probe, a transcript of 7.2 kb was found in 
brain and a weak signal was also seen in lung. No other 
signal was Been in the other tissues tested, i.e. heart, 
liver, skeletal muscle, kidney and testis. 

All detected transcript sires were different, and thus 
no cross-reaction between mRNAs for the different ALKs was 
observed when the specific probes were used. This suggests 
that the multiple transcripts of ALK-1 and ALK-3 are coded 
from the same gene. The mechanism for generation of the 
different transcripts is unknown at present; they may be 
formed by alternative mRNA splicing, differential 
polyadenylation, use of different pr ©mot ore, or by a 
combination of these events. Differences in mRNA splicing 
in the regions coding for the extracellular domains may 
lead to the synthesis of receptors with different 
affinities for ligands, as was shown for mActR-XXB 
(Attisano £jfc £i (1992) Cell ££, - 97-108) or to the 
production of soluble binding protein. 

The above experiments describe the isolation of 
nucleic acid sequences coding for new family of human 
receptor kinases. The cDNA for ALK-5 was then used to 
determine the encoded protein size and binding properties. 
Prcpertie g of the ALKs cDNA Encoded Proteins 

To study the properties of the proteins encoded by the 
different ALK cDKAs, the cDNA for each ALK was subcloned 
into a eukaryotic expression vector and transfected into 
various cell types and then subjected to 
icjDunoprecipitation using a rabbit antiserum raised against 
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a synthetic peptide corresponding to part of the 
intracellular juxtamembrane region. This region is 
divergent in sequence between the various serine/threonine 
kinase receptors. The following amino-acid residues vere 
used: 

ALX-1 145-166 
ALK-2 151-172 
ALK-3 181-202 
ALK-4 153-171 
AIK-5 158-179 
ALK-6 151-168 

The rabbit antiserum against ALK-5 vas designated VPN. 
The peptides were synthesized with an Applied 
Biosystems 430A Peptide Synthesizer using t-butoxycarbonyl 
chemistry and purified by reversed-phase high performance 
liquid chromatography. The peptides were coupled to 
keyhole limpet haemocyanin (Calbiochem-Behring) using 
glutaraldehyde, as described by Cuillick £fc &1 (1985) EKBO 
J. ±, 2869-2877. The coupled peptides were mixed with 
Freunds adjuvant and used to immunize rabbits. 
Transient transfection of the ALK-S cDNA 

COS-1 cells (American Type Culture Collection) and the 
R mutant of MvlLu cells (for references, see below) vere 
cultured in Dulbeeco's modified Eagle's medium containing 
10% fetal bovine serum (FBS) and 100 units/ml penicillin 
and 50 *»g 1ml streptomycin in 5% C0 2 atmosphere at 37 C C. 
The ALK-5 cDNA (nucleotides (-76) - 2232) , which includes 
the complete coding region, was cloned in the pSV7d vector 
(Truett £i fil, (1985) DNA &, 333-349), and used for 
transfection. Transfection into COS-1 cells was performed 
by the calcium phosphate precipitation method (Wigler £i fil 
(1979) Cell Ife, 777-785). Briefly, cells were seeded into 
6-well cell culture plates at a density of 5x10* 
cells/well, and transfected the following day with 10 pg of 
recombinant plasmid. After overnight incubation, cells 
were washed three times with a buffer containing 25 mM 
Tris-HCl, pH 7.4, 138 mM NaCl, 5 mM KCl, 0.7 mM CaCl 2 , 0.5 
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bM KgCl 2 and 0.6 sM VA^P0 it and then incubated with 
Dulbecco's aodified Eagle's aediua containing FBS and 
antibiotics. Two days after transfection, the cells vere 
Betabolically labelled by incubating the cells for 6 hours 
5 in aethionine and cysteine-free KCDB 104 sediun vith 150 
pCi/al of [ S S] -aethionine and ("s 3 -cysteine (in vivo 
labelling mix; Aaershaa) . After labelling, the cells vere 
vashed vith 150 BM NaCl, 25 bM Tris-HCl, pH 7.4, and then 
solubilized vith a buffer containing 20aM Tris-HCl, pH 7.4, 

10 150 bM Had, 10 BH SOTA, 1% Triton X-100, 1% deoxycholate, 
1.5* Trasylol (Bayer) and 1 aK phenylaethylsulf onylf luoride 
(PMSF; Sigma). After 15 minutes on ice, the cell ly sates 
vere pelleted by centrifugation, and the supematants vere 
then incubated vith 7 pi of preiaaune serum for 1.5 hours 

15 at 4 e C. Samples vere then given 50 pi of protein A- 
Sepharose (Fharmacia-IXB) slurry (50% packed beads in 150 
BM NaCl, 20 BM Tris-HCl, pH 7.4, 0.2% Triton X100) and 
incubated for 45 ainutes at 4*C. The beads vere spun dovn 
by centrifugation, and the supernatants (1 al) vere then 

20 incubated vith either 7 pi of preiaaune serua or the VPN 
antiserum for 1.5 hours at 4*C. For blocking, 10 pg of 
peptide vas added together vith the ant i serua. Immune 
complexes vere then given 50 pi of protein A-Sepharose 
(Pharmacia - LKB) slurry (50% packed beads in 150 mM NaCl, 

25 20aM Tris-HCl, pH 7.4, 0.2% Triton X-100) and incubated for 
45 ainutes at 4*C. The beads vere spun dovn and vashed 
four times vith a washing buffer (20 mM Tris-HCl, pH 7.4, 
500 bM NaCl, 1% Triton X-100, 1% deoxycholate and 0.2% 
SDS) , followed by one wash in distilled vater. The immune 

30 complexes vere eluted by boiling for 5 minutes in the SDS— 
sample buffer (100 mM Tris-HCl, pH 8.8, 0.01% bronophenol 
blue, 36% glycerol, 4% SDS) in the presence of 10 bM DTT, 
and analyzed by SDS-gel electrophoresis using 7-15% 
polyacrylaaide gels (Blobel and Dobberstein, (1975) J. Cell 

35 Biol. £2, 835-851). Gels vere fixed, incubated vith 
Amplify (Amersham) for 20 ainutes, and subjected to 
f luorography. A component of 53Da vas 6een. This 
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component vas not seen vhen preiamune serua va» used, or 
when 10 blocking peptide was added together vith the 
antiserua. Moreover, it was not detectable in saaples 
derived froa untransfected COS-1 cells using either 
preimnune serun or the antiserua. 
pjoestion vith Endealveosidase F 

Saaples imBunoprecipitated vith the VPN antisera 
obtained as described above were incubated vith 0.5 V of 
endoglycosidase P (Boehringer Mannheia Bioeheaica) in a 
buffer containing 100 aH sodiua phosphate, pH 6.1, 50 aH 
EDTA, 1% Triton X-100, 0.1% SDS and 1% B-aercaptoethanol at 
37°C for 24 hours. Samples vere eluted by boiling for 5 
minutes in the SDS-sample buffer, and analyzed by SDS- 
p olya cry 1 amide gel electrophoresis as described above. 
Hydrolysis of N-iinked carbohydrates by endoglycosidase T 
shifted the 53 kDa band to 51 kDa. The extracelluar doaain 
of ALK-5 contains one potential acceptor site for H- 
glycosylation and the size of the deglycosylated protein is 
close to the predicted size of the core protein. 
Establishment of PAS Cell Lin es Expressing ALK-S 

In order to investigate whether the ALK-5 cDNA encodes 
a receptor for TGF-B, porcine aortic endothelial (PAS) 
cells were transfected vith an expression vector containing 
the ALK-5 cDNA, and analyzed for the binding of 12S I-TGF-£1. 

FAE cells vere cultured in Ham's F-12 aediua 
supplemented vith 10* FBS and antibiotics (Miyazono #Jt al,, 
(1988) J. Biol. Chea. 2£1$ 6407-6415). The ALK-5 CDNA vas 
cloned into the cytomegalovirus (CKV) -based expression 
vector pcDNA I/NEO (Invitrogen) , and transfected into PAZ 
cells by electroporation. After 48 hours, selection vas 
initiated by adding Ceneticin (C418 sulphate; Cibco - BRL) 
to the culture aediua at a final concentration of 0.5 mg/al 
(Kestermark si fil., (1990) Proc. Natl. Acad. Sci. USA £2, 
128-132) . Several clones vere obtained, and after analysis 
by immunoprecipitation using the VPN antiserua, one clone 
denoted PAE/TBR-1 vas chosen and further analyzed. 
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Todinaticn of TGF-M, Binding and Affinity erosglinkinq 

Recombinant human TGF-£1 was iodinated using the 
chloramine T method according to Frolik al., (1984) J. 
Biol* Chem. 2H, 10995-11000. Cross-linking experiments 
5 vere performed as previously described (Ichijo pi. f 
(1990) Exp. Cell Res, JL12, 263-269). Briefly, cells in 6- 
veil plates vere vashed vith binding buffer (phosphate- 
buffered saline containing 0.9 mM CaCl 2 , 0.49 &M KgCl 2 and 
1 mg/ml bovine serum albumin (BSA) ) , and incubated on ice 

10 in the same buffer vith 125 I-TGF-B1 in the presence or 
absence of excess unlabelled TGF-B1 for 3 hours. Cells 
vere vashed and cross-linking vas done in the binding 
buffer without BSA together vith 0.28 mM disuccinimidyl 
suberate (DSS; Pierce Chemical Co.) for 15 minutes on ice. 

15 The cells vere harvested by the addition of 1 ml of 
detachment buffer (10 mM Tris-BCl, pH 7.4, 1 &H EDTA, 10% 
glycerol, 0.3 mM PMSF) • The cells vere pelleted by 
centrifugation, then resuspended in 50 jil of solubilization 
buffer (125 mM NaCl, 10 mM Tris-HCl, pR 7.4, 1 mM EDTA, 1% 

20 Triton X-100, 0.3 mM PMSF, 1% Trasylol) and incubated for 
40 minutes on ice. Cells vsre centrifuged again and 
supernatants vere subjected to analysis by SDS-gel 
electrophoresis using 4-15% polyacrylamide gels, folloved 
by autoradiography. I-TGF-B1 formed a 70 kDa cross- 

25 linked complex in the transfected FAE cells (PAE/TBR-I 
cells) • The size of this complex vas very similar to that 
of the TGF-B type I receptor complex observed at lover 
amounts in the untransf ected cells. A concomitant increase 
of 94 kDa TGF-B type II receptor complex could also be 

30 observed in the FAE/TBR-I cells. Components of 150-190 
kDa, vhich may represent crosslinked complexes betveen the 
type I and type II receptors, vere also observed in the 
PAE/TBR-I cells. 

In order to determine vhether the cross-linked 70 kDa 

35 complex contained the protein encoded by the ALK-5 cDNA, 
the affinity cross-linking vas folloved by 
immunoprecipitation using the VPN antiserum. For this, 
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cells in 25 cm flasks were used. The supernatant* 
obtained after cross-linking were incubated with 7 pi of 
preimmune serum or VPN antiserua in the presence or absence 
of 10 nq of peptide for 1.5h at 4°C. Immune complexes were 
then added to 50 til of protein A-Sepharose slurry and 
incubated for 45 minutes at 4*C. The protein A-Sepharose 
beads were washed four times with the washing buffer, once 
with distilled water, and the samples were analysed by SOS- 
gel electrophoresis using 4-15% polyecrylamide gradient 
gels and autoradiography. A 70 kDa cross-linked complex 
was precipitated by the VPN antiserua in PAE/TBR-1 cells, 
and a weaker band of the same size was also seen in the 
untransfected cells, indicating that the untransfected PAE 
cells contained a low amount of endogenous ALK-5. The 70 
kDa complex was not observed when preimnune serum was used, 
or when immune serum was blocked by 10 jxg of peptide. 
Moreover, a coprecipitated 94 kDa component could also be 
observed in the PAE/TBR-I cells. The latter component is 
likely to represent a TGF-B type II receptor complex, since 
an antiserum, termed DRL, which was raised against a 
synthetic peptide from the C-terainal part of the TGF-B 
type II receptor, precipitated a 94 kDa TGF-B type II 
receptor complex, as well as a 70 kDa type I receptor 
complex from PAE/TBR-I cells. 

The carbohydrate contents of ALK-5 and the TGF-B type 
II receptor were characterised by deglycosylation using 
endoglycosidase F as described above and analyzed by SDS- 
polyacrylamide gel electrophoresis and autoradiography. 
The ALK-5 cross-linked complex shifted from 70 kDa to 66 
kDa, whereas that of the type II receptor shifted from 94 
kDa to 82 kDa. The observed larger shift of the type II 
receptor band compared with that of the ALK-5 band is 
consistent with the deglycosylation data of the type I and 
type II receptors on rat liver cells reported previously 
(Cheifetz e£ £l (1988) J. Biol. Chem. 26? f 16984-16991), 
and fits well with the fact that the porcine TGF-B type II 
receptor has two N-glycosylation sites (Lin sSi ll (1992) 




* 
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Cell Mr 775-785), whereas ALK-5 has only one (see SEQ ID 
No. 9). 

Binding of TCF-B1 to the type I receptor is known to 
be abolished by transient treatment of the cells with 
5 dithiothreitol (DTT) (Cheifetz and Kassague (1991) J. Siol. 
Chem. 266. 20767-20772; Wrana ^1 (1992) Cell 21/ 1003- 
1014) . When analyzed by affinity cross-linking, binding of 
I-TGF-B1 to ALK-5, but not to the type 21 receptor, vas 
completely abolished by DTT treatment of FAE/TBR-1 cells* 
10 Affinity cross-linking followed by immunoprecipitation by 
the VPN antiserum shoved that neither the ALK-5 nor the 
type II receptor complexes vas precipitated after DTT 
treatment, indicating that the VPN antiserum reacts only 



15 recognizes a TGF-B type I receptor, and that the type I and 
type II receptors form a heteromeric complex, 
12S I-TGF-B* Binaincj S Affinity Crosslinking of T ransfeeted 
COg Cells 

Transient expression plasmids of ALKs -1 to -6 and 

20 TBR-II vere generated by subcloning into the pSV7d 
expression vector or into the pcDNA I expression vector 
(Invitrogen) . Transient transfection of COS-1 cells and 
iodination of TGF-B1 vere carried out as described above. 
Crosslinking and immunoprecipitation vere performed as 

25 described for FAE cells above. 

Transfectipn of cDHAs for ALKs into COS-1 cells did 
not show any appreciable binding of 125 I-TGFB1, consistent 
with the observation that type I receptors do not bind TGF- 
fi in the absence of type II receptors. When the TBR-II 

30 cDNA vas co-transf ected with cDNAs for the different AUCs # 
type I receptor-like complexes vere seen, at different 
levels, in each case. COS-1 cells transf ected vith TBR-II 
and ALK cDNAs vere analyzed by affinity crosslinking 
followed by immunoprecipitation using the DRL antisera or 

35 specific antisera against ALKs. Each one of the ALKs bound 
125 I-TGF-B1 and vas coimmunoprecipitated vith the TBR-II 
complex using the DRL antiserum. Comparison of the 



vith ALK-5. 



The data show that the VPN antiserum 
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efficiency of the different ALKs to torn heteromeric 
complexes with TBR-ZI, revealed that ALK-5 formed such 
complexes more efficiently than the ether ALKs. The size 
of the cresslinked complex vas larger for ALK-3 than for 
5 other ALKs, consistent with its slightly larger sire. 
Expression of the ALK Protein In Different Cell Types 

Two different approaches were used to elucidate vhich 
ALK's are physiological type Z receptors for TCF-fi. 

Firstly, several cell lines were tested for the 

10 expression of the ALK proteins by cross-linking followed by 
immunoprecipitation using the specific antiseras against 
ALKs and the TGF-B type ZZ receptor. The sink lung 
epithelial cell line, MvlLu, is widely used to provide 
target cells for TGF-B action and is veil characterized 

15 regarding TGF-B receptors (Laiho £fc (1990) J. Biol. 
Chea. 211, 18518-1B524; Laiho £i aJL (1991) J. Biol. Ches. 
266, 9108-9112). Only the VPN antiserua efficiently 
precipitated both type 1 and type II TGF-B receptors in the 
wild type KvlLu eells. The DHL antiserua also precipitated 

20 components with the same size as those precipitated by the 
VPN antiserum. A autant cell line (R mutant) vhich lacks 
the TGF-B type I receptor and does not respond to TGF-B 
(Laiho e£ al, sucral vas also investigated by cross-linking 
followed by immunoprecipitation. Consistent with the 

25 results obtained by Laiho e£ a_l (1990), supra the type IIZ 
and type II TGF-B receptor complexes, but not the type Z 
receptor complex, vere observed by affinity crosslinking. 
Cross linking followed by imaunoprecipatition using the DRL 
antiserum revealed only the type ZZ receptor complex, 

30 whereas neither the type Z nor type ZZ receptor complexes 
was seen using the VPN antiserua. When the cells vere 
metabolically labelled and subjected to immunoprecipitation 
using the VPN antiserua, the 53 kDa ALK-5 protein, vas 
precipitated in both the vild-type and R mutant KvlLu 

35 cells. These results suggest that the type Z receptor 
expressed in the R mutant is ALK-5, vhich has lost the 
affinity for binding to TGF-B after mutation. 
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The type 2 and type II TCF-B receptor complexes could 
be precipitated by the VPN and DHL antisera in other cell 
lines, including human foreskin fibroblasts (AG1518), human 
lung adenocarcinoma cells (A549), and human oral squamous 
cell carcinoma cells (HSC-2) • . Affinity cross-linking 
studies revealed multiple TGF-B type I receptor-like 
complexes of 70-77 kDa in these cells. These components 
vere less efficiently competed by excess unlabelled TGF-fil 
in HSC-2 cells. Moreover, the type II receptor complex was 
low or not detectable in A549 and BSC-2 cells. Cross- 
linking followed by imaunoprecipitation revealed that the 
VPN antiserum precipitated only the 70 kDa complex among 
the 70-77 kDa components. The DRL antiserum precipitated 
the 94 kDa type II receptor complex as veil as the 70 kDa 
type I receptor complex in these cells, but not the 
putative type I receptor complexes of slightly larger 
sizes. These results suggest that multiple type I TGF-B 
receptors may exist and that the 70 kDa complex containing 
ALK-5 forms a heterbmeric complex with the TGF-B type XI 
receptor cloned by Lin e£ al (1992) Cell ££, 775-785, more 
efficiently that the other species. In rat 

pheochromocytoma cells (PC12) vhich have been reported to 
have no TGF-B receptor complexes by affinity cross-linking 
(Hassague £i fil (1990) Ann. N.Y. Acad. Sci. 593 . 59-72), 
neither VPN nor DRL antisera precipitated the TGF-B 
receptor complexes. The antisera against ALKs -1 to -4 and 
ALK6 did not efficiently imounoprecipitate the erosslinked 
receptor complexes in porcine aortic endothelial (PAZ) 
cells or human foreskin fibroblasts. 

Next, it vas investigated whether ALKs could restore 
responsiveness to TGF-B in the R mutant of KvlLu cells, 
which lack the ligand-binding ability of the TGF-B type I 
receptor but have intact type II receptor. Wild-type KvlLu 
cells and mutant cells were transfected with ALK cDNA and 
were then assayed for the production of plasminogen 
activator inhibitor-1 (PAI-1) which is produced as a result 
of TGF-B receptor activation as described previously by 
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Laiho si ll (1991) Hoi. Cell Biol. 11, 972-978. Briefly, 
cells were added with or without 10 ng/al of TGF-Bl for 2 
hours in serua-free MCDB 104 without aethionine. 
Thereafter, cultures were labelled with l^S] nethionine (40 
pCi/Bl) for 2 hours. The cells were reaoved by washing on 
ice once in PBS, twice in 10 aM Tris-HCl (pH 8.0), 0.5% 
sodiua deoxycholate, 1 all PKSF, twice in 2 &M Tris-HCl (pH 
8.0), and once in PBS. Extracellular aatrix proteins were 
extracted by scraping cells into the SDS-saaple buffer 
containing DTT, and analyzed by SDS-gel electrophoresis 
followed by fluorography using Aaplify. PAJ-1 can be 
identified as a characteristic 45kDa band (Laiho al 
(1991) Mol. Cell Biol. H, 972-978). Wild-type MvlLu cells 
responded to TGF-B and produced PAI-l, whereas the R autant 
clone did not, even after stiaulation by TGF-Bl. Transient 
transfection of the ALK-5 cDNA into the R autant clone led 
to the production of PAI-1 in response to the stiaulation 
by TGF-Bl, indicating that the ALK-5 cDNA encodes a 
functional TGF-B type I receptor. in contrast, the R 
nutant cells that were transfected with other ALKs did not 
produce PAI-1 upon the addition of TGF-Bl • 

Using similar approaches as those described above for 
the identification of TGF-B-binding ALKs, the ability of 
ALKs to bind activin in the presence of ActRII was 
exaained. COS-1 cells were co-transfected as described 
above. Recoabinant hunan activin A was iodinated using the 
chloranine T aethod (Kathews and Vale (1991) Cell 973- 
9E2). Transfected COS-1 cells were analysed for binding 
and crosslinking of ,25 I-activin A in the presence or 
absence of excess unlabelled activin A. The crosslinxed 
complexes were subjected to iamunoprecipitation using DRL 
antisera or specific ALK antisera. 

All ALKs appear tc bind activin A in the presence of 
Act R-II. This is aore clearly demonstrated by affinity 
cross-linking followed by imaunopreciptation. ALK-2 and 
ALK-4 bound 125 I-activin A and were coiaaunoprecipitated 
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with ActR-II. Other ALKs also bound 125 I-aetivin A but with 
a lower efficiency coapared to ALK-2 and ALK-4. 

In order to investigate whether ALKs are physiological 
activin type I receptors, activin responsive cells were 
examined for the expression of endogenous activin type I 
receptors. HvlLu cells, as well as the R mutant, express 
both type I and type II receptors for activin, and the R 
autant cells produce PAI-1 upon the addition of activin A. 
MvlLu cells were labeled with 12S I-activin A, cross-linked 
and immunoprecipitated by the antisera against ActR-II or 
ALKs as described above. 

The type I and type II receptor complexes in MvlLu 
cells were immunoprecipitated only by the antisera against 
ALK-2, ALK-4 and ActR-II. similar results were obtained 
using the R autant cells. RAE cells do not bind activin 
because of the lack of type II receptors for activin, and 
so cells were transfected with a chimeric receptor, to 
enable then to bind activin, as described herein. A 
plasaid (chim A) containing the extracelluar domain and C- 
terminal tail of Act R-II (amino-acids -19 to 116 and 465 
to 494, respectively (Kathevs and Vale (1991) Cell, ££, 
973-982)) and the kinase domain of TBR-II (amino-acids 160- 
543) (Lin £i aJL (1992) Cell, ££, 775-785) was constructed 
and transfected into pcDNA/neo (Invitrogen) . PAE cells 
were stably transfected with the chim A plasmid by 
electroporation, and cells expressing the chin A protein . 
were established as described previously. PAE/Chim A cells 
were then subjected to 12S I-activin A labelling crosslinking 
and imaunoprecipltation as described above. 

Similar to MvlLu cells, activin type I receptor 
complexes in PAE/Chim A cells were immunoprecipitated by 
the ALK-2 and ALK-4 antisera. These results show that both 
ALK-2 and ALK-4 serve as high affinity type I receptors for 
activin A in these cells. 

ALK-1, ALK-3 and ALK-6 bind TCF-Sl and activin A in 
the presence of their respective type II receptors, but the 
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functional consequences of the binding of the ligands remains to be 
elucidated. 

The experiments described supra suggested further experiments ♦ 
Specifically, it is known that TGF-S family members act as ligands 
in connection with specific type I and type II receptors, with 
resulting complexes interacting with members of the Smad family. 
See Heldin et al., Nature 390: 465-471 (1997), incorporated by 
reference. The Smad molecules are homologs of molecules found in 
Drosophila ("Mad"), and C. eleaans (SMa) , hence, the acronym 
"Smad". These are involved in signal transduction pathways 
downstream of serine/threonine kinase receptors. See Massague et 
al.. Trends Cell Biol. 2: 187-192 (1997) . The different members of 
the family have different signalling roles. Smadl, for example, as 
well as Smad 2 and 3, and perhaps Smad 5, become phosphorylated via 
specific type 1 serine/ threonine kinase receptors, and act in 
pathway restricted fashion. For example, Xenopus Madl induces 
ventral mesoderm, in the presence of BMP. The human Smadl has been 
shown to have ventralizing activity. See Liu et al., Nature 381: 
620-623 (1996); Kretzschmer et al., Genes Dev 11: 984-995 (1997). 
There is also some evidence that TGF-6 phosphorylates Smadl. See 
Lechleider et al., J. Biol. Chem. 271: 17617-17620 (1996); Yingling 
et al., Proc. Natl. Acad. Sci. USA 93: 8940-8944 (1996). Given 
what was known regarding this complex signalling pathway, the role 
of ALK-1 was studied. 

COS -7 cells, which do not express ALK-1, were transfected with 
cDNA encoding tagged ALK-1. The tag was hemagluttinin (hereafter 
"HA")/ and a commercially available lipid containing transfecting 
agent was used. In parallel experiments, porcine aortic 
endothelial (PAE) cells were also used, because these cells express 
TGFS type II receptors, and ALK-5, but not ALK-1. Hence, PAE cells 
were either transfected, or not. Transfection protocols are given, 
supra . 

The cells were then contacted with 125 I labelled TGF-S1, and 
were then contacted with ALK-1 specific antisera, to ascertain 
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whether cross linking had occurred. See the experiments, supra, as 
well as ten Dijke et al., Science 264: 101-104 (1994), incorporated 
by reference. Antisera to ALK-5 were also used. 

The results indicated that the ALK-1 antiserum 
immunoprecipitated complexes of the appropriate size from the 
transfected COS-7 and PAE cells, but not those which were not 
transfected, thereby establishing that ALK-1 is a receptor for T6F- 
S. 

This was confirmed in experiments on human umbilical vein 
endothelial cells (HUVEC) . These cells are known to express ALK-1 
endogenously, as well as ALK-5. The ALK-5 antiserum and the ALK-1 
antiserum both immunoprecipitated type I and type II receptor cross 
linked complexes. The ALK-1 antiserum immunoprecipitated band 
migrated slightly more slowly than the band immunoprecipitated by 
the ALK-5 antiserum. This is in agreement with the difference in 
size of ALK-1. and ALK-5, and it indicates that both ALK-1 and ALK-5 
bind TGF-fil in HDVECS. 

Further, it shows that ALK-1 acts as a co-called "type I n TGF- 
£ receptor in an endogenous, physiological setting. 

Once it was determined that TGF-fil and ALK-1 interact, studies 
were carried out to determine whether or not activation of ALK-1 
resulted in phosphorylation of Smads. To test this, COS-7 cells 
were transfected in the same manner described supra with either 
Flag tagged Smadl or Flag tagged Smad2 together with either a 
constitutively active form of ALK-1, or a constitutively active 
form of ALK-5. Specifically, the variant of ALK-1 is Q201D, and 
that of ALK-5 is T204D. Constitutively active ALK-1 was used to 
avoid the need for an additional transfection step. To elaborate, 
it is known that for the TGF-S pathway to function adequately, a 
complex of two, type I receptors, and two, type II receptors must 
interact, so as to activate the receptors. Constitutively active 
receptors, such as what was used herein, do not require the 
presence of the type II receptor to function. See Wieser et al., 
EMBO J 14: 2199-2208 (1995). In order to determine if the 
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resulting transfected cells produced phosphorylated Smads, Smads 
were determined using a Flag specific antibody, which precipitated 
them, and phosphorylation was determined using the 
antiphosphoserine antibody of Nishimura et al., J. Biol. Chem. 273: 
1872-1879 (1998) . It was determined, when the data were analyzed, 
that Smadl was phosphorylated following interaction with activated 
ALK-1, but not following interaction of TGF-S and ALK-5. 
Conversely, the interaction of T6F-S and ALK-5 led to 
phosphorylation of Smad 2, but not Smad 1. This supports a 
conclusion that ALK-1 transduces signal in a manner similar to 
BMPs. 

Additional experiments were then carried out to study the 
interaction of ALK-1 with Smad-1. Specifically, COS-7 cells were 
transfected with cDNA which encoded the wild type form of the TGFS 
type II receptor (TBR-II) , a kinase inactive form of ALK-1, and 
Flag tagged Smad-1. Kinase inactive ALK-1 was used, because the 
interaction of Smad-1 and receptors is known to be transient, as 
once Smads are phosphorylated they dissociate from the type I 
receptor. See Marcias-Silva et al., Cell 87: 1215-1224 (1996); 
Nakao et al., EMBO J 16: 5353-5362 (1997) . Affinity cross-linking, 
using 125 I-TGF-S1, and immunoprecipitation with Flag antibody was 
carried out, as discussed supra . The expression of ALK-1 was 
determined using anti-HA antibody, since the vector used to express 
ALK-1 effectively tagged it with HA. 

The immunoprecipitating of Smadl resulted in coprecipitation 
of a cross linked TBR-II/ALK-1 complex, suggesting a direct 
association of Smadl with ALK-1. 

These examples show that one can identify molecules which 
inhibit, or enhance expression of a gene whose expression is 
regulated by phosphorylated Smadl. To elaborate, as ALK-1 has been 
identified as a key constituent of the pathway by which Smadl is 
phosphorylated, one can contact cells which express both Smadl and 
ALK-1 with a substance of interest, and then determine if the Smadl 
becomes phosphorylated. The cells can be those which inherently 
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express both ALK-1 and Smadl, or which have been transformed or 
transfected with DNA encoding one or both of these. One can 
determine the phosphorylation via, e.g., the use of anti 
phosphorylated serine antibodies, as discussed supra. In an 
especially preferred embodiment, the assay can be carried out using 
TGF-ft, as a competing agent. The TGF-S, as has been shown, does 
bind to ALK-1, leading to phosphorylation of Smadl. Hence, by 
determining a value with TGF-S alone, one can then compare a value 
determined with amounts of the substance to be tested, in the 
presence of TGF-fi. Changes in phosphorylation levels can thus be 
attributed to the test substance. 

In this type of system, it must be kept in mind that both type 

I receptors and type II receptors must be present; however, as 
indicated, supra, one can eliminate the requirement for a type II 
receptor by utilizing a constitutively active form of ALK-1, such 
as the form described supra. Additional approaches to inhibiting 
this system will be clear to the skilled artisan. For example, 
since it is known that there is interaction between Smadl and the 
ALK-1 receptor, one can test for inhibition via the use of small 
molecules which inhibit the receptor/Smad interaction. Heldin et 
al., supra, mention Smad6 and Smad7 as Smadl inhibitors, albeit in 
the context of a different system. Hence one can test for 
inhibition, or inhibit the interaction, via adding a molecule to be 
tested or for actual inhibition to a cell, wherein the molecule is 
internalized by the cell, followed by assaying for phosphorylation, 
via a method such as is discussed supra . 

In a similar way, one can assay for inhibitors of type I/type 

II receptor interaction, by testing the molecule of interest in a 
system which includes both receptors, and then assaying for 
phorphorylation. 

Conversely, activators or agonists can also be tested for, or 
utilized, following the same type of procedures. 

Via using any of these systems, one can identify any gene or 
genes which are activated by phosphorylated Smadl. To elaborate, 
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the art is very familiar with systems of expression analysis, such 
as differential display PCR, subtraction hybridization, and other 
systems which combine driver and testes populations of nucleic 
acids, whereby transcripts which are expressed or not expressed can 
be identified. By simply using an activator/inhibitor of the 
system disclosed herein, on a first sample, and a second sample 
where none is used, one can then carry out analysis of transcript, 
thereby determining the transcripts of interest. 

Also a part of the invention is the regulation of 
phosphorylation of Smad-1, with inhibitors, such as antibodies 
against the extracellular domain of ALK-1 or TGF-S, or enhancers, 
such as TGF-6 itself, or those portions of the TGF-fi molecule which 
are necessary for binding. Indeed, by appropriate truncation, one 
can also determine what portions of ALK-1 are required for 
phosphorylation of Smadl to take place. 

The invention has been described by way of example only, 
without restriction of its scope. The invention is defined by the 
subject matter herein, including the claims that follow the 
immediately following full Sequence Listings. 
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(i) APPLICANTS: Miyazono, Kohei; Takeshe Imamura; ten Dijke, Peter 

(ii) TITLE OF INVENTION: Isolated ALK-1 Protein, Nucleic Acids Encoding 
It, And Uses Thereof 
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(A) ADDRESSEE: Felfe & Lynch 
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(vii) PRIOR APPLICATION DATA: 
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ff (2) INFORMATION FOR SEQ ID NO: 1: 

^ (i) SEQUENCE CHARACTERISTICS: 

?1 (A) LENGTH: 1984 base pairs 

~~~ 4 (B) TYPE: nucleic acid 

L (C) STRANDEDNESS : unknown 

H (D) TOPOLOGY: linear 

& E (ii) MOLECULE TYPE: cDNA 

^ (iii) HYPOTHETICAL: NO 

^ (iv) ANTI- SENSE: NO 

W (v) FRAGMENT TYPE: internal 

ffl (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 283.. 1791 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CC ATG ACC TTG GGC 294 

Met Thr Leu Gly 
1 
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TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG ATG GCC TTG GTG ACC CAG 342 
Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala Leu Val Thr Gin 
5 10 15 20 

GGA GAC CCT GTG AAG CCG TCT CGG GGC CCG CTG GTG ACC TGC ACG TGT 390 
Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val Thr Cys Thr Cys 

25 30 35 

GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG GCC TGG TGC ACA 438 
Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly Ala Trp Cys Thr 
40 45 50 

GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG GAA CAT CGG GGC 486 
Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin Glu His Arg Gly 
55 60 65 

TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC CCC ACC GAG TTC 534 
Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg Pro Thr Glu Phe 
n 70 75 80 

™ — ■ 

qGTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC CAC AAC GTG TCC 582 

Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn His Asn Val Ser 
% 85 90 95 100 

[I CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG CCG GGA ACA GAT 630 
■."l Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin Pro Gly Thr Asp 
^ 105 110 115 

GGC CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG GCC TTG CTG GCC CTG 678 
M Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala Leu Leu Ala Leu 
^ 120 125 130 

| GTG GCC CTG GGT GTC CTG GGC CTG TGG CAT GTC CGA CGG AGG CAG GAG 726 
& Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg Arg Arg Gin Glu 
135 140 145 

AAG CAG CGT GGC CTG CAC AGC GAG CTG GGA GAG TCC AGT CTC ATC CTG 774 
Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser Ser Leu lie Leu 
150 155 160 

AAA GCA TCT GAG CAG GGC GAC ACG ATG TTG GGG GAC CTC CTG GAC AGT 822 
Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp Leu Leu Asp Ser 
165 170 175 180 

GAC TGC ACC ACA GGG AGT GGC TCA GGG CTC CCC TTC CTG GTG CAG AGG 870 
Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu Val Gin Arg 

185 190 195 

ACA GTG GCA CGG CAG GTT GCC TTG GTG GAG TGT GTG GGA AAA GGC CGC 918 
Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly Lys Gly Arg 
200 205 210 
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TAT GGC GAA GTG TGG CGG GGC TTG TGG CAC GGT GAG AGT GTG GCC GTC 966 
Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu Ser Val Ala Val 
215 220 225 

AAG ATC TTC TCC TCG AGG GAT GAA CAG TCC TGG TTC CGG GAG ACT GAG 1014 
Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg Glu Thr Glu 
230 235 240 

ATC TAT AAC ACA GTA TTG CTC AGA CAC GAC AAC ATC CTA GGC TTC ATC 1062 
lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu Gly Phe lie 
245 250 255 260 

GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG CTG TGG CTC ATC 1110 
Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu Trp Leu lie 

265 270 275 

ACG CAC TAC CAC GAG CAC GGC TCC CTC TAC GAC TTT CTG CAG AGA CAG 1158 
Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu Gin Arg Gin 
Q 280 285 290 

pACG CTG GAG CCC CAT CTG GCT CTG AGG CTA GCT GTG TCC GCG GCA TGC 1206 
i^jThr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val Ser Ala Ala Cys 
^ 295 300 305 

!rj GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA CAG GGC AAA CCA 1254 
J\ Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin Gly Lys Pro 
310 315 320 

fjGCC ATT GCC CAC CGC GAC TTC AAG AGC CGC AAT GTG CTG GTC AAG AGC 1302 
Ala lie Ala His Arg Asp Phe Lys Ser Arg Asn Val Leu Val Lys Ser 
325 330 335 340 

J AAC CTG CAG TGT TGC ATC GCC GAC CTG GGC CTG GCT GTG ATG CAC TCA 1350 
W Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val Met His Ser 

345 350 355 

CAG GGC AGC GAT TAC CTG GAC ATC GGC AAC AAC CCG AGA GTG GGC ACC 1398 
Gin Gly Ser Asp Tyr Leu Asp lie Gly Asn Asn Pro Arg Val Gly Thr 
360 365 370 

AAG CGG TAC ATG GCA CCC GAG GTG CTG GAC GAG CAG ATC CGC ACG GAC 1446 
Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin lie Arg Thr Asp 
375 380 385 

TGC TTT GAG TCC TAC AAG TGG ACT GAC ATC TGG GCC TTT GGC CTG GTG 1494 
Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe Gly Leu Val 
390 395 400 

CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC ATC GTG GAG GAC 1542 
Leu Trp Glu lie Ala Arg Arg Thr lie Val Asn Gly lie Val Glu Asp 
405 410 415 420 



• 
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TAT AGA CCA CCC TTC TAT GAT GTG GTG CCC AAT GAC CCC AGC TTT GAG 1590 
Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 

425 430 435 

GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC CCC ACC ATC CCT 1638 
Asp Net Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr lie Pro 
440 445 450 

AAC CGG CTG GCT GCA GAC CCG GTC CTC TCA GGC CTA GCT CAG ATG ATG 1686 
Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 
455 460 465 

CGG GAG TGC TGG TAC CCA AAC CCC TCT GCC CGA CTC ACC GCG CTG CGG 1734 
Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg 
470 475 480 

ATC AAG AAG ACA CTA CAA AAA ATT AGC AAC AGT CCA GAG AAG CCT AAA 1782 
lie Lys Lys Thr Leu Gin Lys lie Ser Asn Ser Pro Glu Lys Pro Lys 
.485 490 495 500 

—GTG ATT CAA TAGCCCAGGA GCACCTGATT CCTTTCTGCC TGCAGGGGGC 1831 
J;? Val lie Gin 

Jj TGGGGGGGTG GGGGGCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT GAGTGTGGTG 1891 

M TGTGCTGGGG ATGGGCAGCT GCGCCTGCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 

5 ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 1984 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala 
15 10 15 

Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 
20 25 30 

Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 
35 40 45 

Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 
50 55 60 
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Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 
65 70 75 80 

Pro Thr Glu Phe Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn 

85 90 95 

His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 110 

Pro Gly Thr Asp Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala 
115 120 125 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 
130 135 140 

Arg Arg Gin Glu Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser 
145 150 155 160 

HiSer Leu lie Leu Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp 
J 165 170 175 

2 Leu Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe 
S 180 185 190 

CjLeu Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val 
J 195 200 205 

i==Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu 
H 210 215 220 

Ser Val Ala Val Lys He Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe 
~ 225 230 235 240 

W Arg Glu Thr Glu He Tyr Asn Thr Val Leu Leu Arg His Asp Asn He 

245 250 255 

Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin 
260 265 270 

Leu Trp Leu He Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe 
275 280 285 

Leu Gin Arg Gin Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val 
290 295 300 

Ser Ala Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Gly Thr 
305 310 315 320 

Gin Gly Lys Pro Ala He Ala His Arg Asp Phe Lys Ser Arg Asn Val 

325 330 335 
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Leu Val Lys Ser Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala 
340 345 350 

Val Met His Ser Gin Gly Ser Asp Tyr Leu Asp lie Gly Asn Asn Pro 
355 360 365 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin 
370 375 380 

lie Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala 
385 390 395 400 

Phe Gly Leu Val Leu Trp Glu lie Ala Arg Arg Thr lie Val Asn Gly 

405 410 415 

lie Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp 
420 425 430 

Pro Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr 
435 440 445 

Pro Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu 
450 455 460 

Ala Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu 
465 470 475 480 

Thr Ala Leu Arg lie Lys Lys Thr Leu Gin Lys lie Ser Asn Ser Pro 

485 490 495 

Glu Lys Pro Lys Val lie Gin 
500 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCCGAGTAC CCCAGTGACC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 

GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 

Met Val Asp Gly 
1 

GTG ATG ATT CTT CCT GTG CTT ATC ATG ATT GCT CTC CCC TCC CCT AGT 
Val Met lie Leu Pro Val Leu lie Met lie Ala Leu Pro Ser Pro Ser 
5 10 15 20 

ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATG TGT GTG 
Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu Tyr Met Cys Val 

25 30 35 

TGT GAA GGT CTC TCC TGC GGT AAT GAG GAC CAC TGT GAA GGC CAG CAG 
Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys Glu Gly Gin Gin 
40 45 50 

TGC TTT TCC TCA CTG AGC ATC AAC GAT GGC TTC CAC GTC TAC CAG AAA 
Cys Phe Ser Ser Leu Ser lie Asn Asp Gly Phe His Val Tyr Gin Lys 
55 60 65 

GGC TGC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 
Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr Cys Lys Thr Pro 
70 75 80 

CCG TCC CCT GGC CAA GCT GTG GAG TGC TGC CAA GGG GAC TGG TGT AAC 
Pro Ser Pro Gly Gin Ala Val Glu Cys Cys Gin Gly Asp Trp Cys Asn 
85 90 95 100 

AGG AAC ATC ACG GCC CAG CTG CCC ACT AAA GGA AAA TCC TTC CCT GGA 
Arg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys Ser Phe Pro Gly 

105 110 115 



# 
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ACA CAG AAT TTC CAC TTG GAG GTT GGC CTC ATT ATT CTC TCT GTA GTG 499 
Thr Gin Asn Phe His Leu Glu Val Gly Leu lie lie Leu Ser Val Val 
120 125 130 

TTC GCA GTA TGT CTT TTA GCC TGC CTG CTG GGA GTT GCT CTC CGA AAA 547 
Phe Ala Val Cys Leu Leu Ala Cys Leu Leu Gly Val Ala Leu Arg Lys 
135 140 145 

TTT AAA AGG CGC AAC CAA GAA CGC CTC AAT CCC CGA GAC GTG GAG TAT 595 
Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg Asp Val Glu Tyr 
150 155 160 

GGC ACT ATC GAA GGG CTC ATC ACC ACC AAT GTT GGA GAC AGC ACT TTA 643 
Gly Thr lie Glu Gly Leu lie Thr Thr Asn Val Gly Asp Ser Thr Leu 
165 170 175 180 

GCA GAT TTA TTG GAT CAT TCG TGT ACA TCA GGA AGT GGC TCT GGT CTT 691 
Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly Ser Gly Leu 
=! 185 190 195 

;CCT TTT CTG GTA CAA AGA ACA GTG GCT CGC CAG ATT ACA CTG TTG GAG 739 
tpro Phe Leu Val Gin Arg Thr Val Ala Arg Gin lie Thr Leu Leu Glu 
{ 200 205 210 

\ TGT GTC GGG AAA GGC AGG TAT GGT GAG GTG TGG AGG GGC AGC TGG CAA 787 
J Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Gin 
i 215 220 225 

* GGG GAA AAT GTT GCC GTG AAG ATC TTC TCC TCC CGT GAT GAG AAG TCA 835 
* Gly Glu Asn Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Lys Ser 
230 235 240 

1 TGG TTC AGG GAA ACG GAA TTG TAC AAC ACT GTG ATG CTG AGG CAT GAA 883 
s Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met Leu Arg His Glu 
245 250 255 260 

AAT ATC TTA GGT TTC ATT GCT TCA GAC ATG ACA TCA AGA CAC TCC AGT 931 
Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg His Ser Ser 

265 270 275 

ACC CAG CTG TGG TTA ATT ACA CAT TAT CAT GAA ATG GGA TCG TTG TAC 979 
Thr Gin Leu Trp Leu He Thr His Tyr His Glu Met Gly Ser Leu Tyr 
280 285 290 

GAC TAT CTT CAG CTT ACT ACT CTG GAT ACA GTT AGC TGC CTT CGA ATA 1027 
Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cys Leu Arg He 
295 300 305 

GTG CTG TCC ATA GCT AGT GGT CTT GCA CAT TTG CAC ATA GAG ATA TTT 1075 
Val Leu Ser He Ala Ser Gly Leu Ala His Leu His He Glu He Phe 
310 315 320 
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GGG ACC CAA GGG AAA CCA GCC ATT GCC CAT CGA GAT TTA AAG AGC AAA 1123 
Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys 
325 330 335 340 

AAT ATT CTG GTT AAG AAG AAT GGA CAG TGT TGC ATA GCA GAT TTG GGC 1171 
Asn lie Leu Val Lys Lys Asn Gly Gin Cys Cys lie Ala Asp Leu Gly 

345 350 355 

CTG GCA GTC ATG CAT TCC CAG AGC ACC AAT CAG CTT GAT GTG GGG AAC 1219 
Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu Asp Val Gly Asn 
360 365 370 

AAT CCC CGT GTG GGC ACC AAG CGC TAC ATG GCC CCC GAA GTT CTA GAT 1267 
Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp 
375 380 385 

GAA ACC ATC CAG GTG GAT TGT TTC GAT TCT TAT AAA AGG GTC GAT ATT 1315 
Glu Thr lie Gin Val Asp Cys Phe Asp Ser Tyr Lys Arg Val Asp lie 
_ 390 395 400 

j — c 

5 TGG GCC TTT GGA CTT GTT TTG TGG GAA GTG GCC AGG CGG ATG GTG AGC 1363 
W Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg Arg Met Val Ser 
W 405 410 415 420 

M= AAT GGT ATA GTG GAG GAT TAC AAG CCA CCG TTC TAC GAT GTG GTT CCC 1411 
SI Asn Gly lie Val Glu Asp Tyr Lys Pro Pro Phe Tyr Asp Val Val Pro 
si 425 430 435 

p AAT GAC CCA AGT TTT GAA GAT ATG AGG AAG GTA GTC TGT GTG GAT CAA 1459 

u\ Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val Cys Val Asp Gin 
u 440 445 450 

'% CAA AGG CCA AAC ATA CCC AAC AGA TGG TTC TCA GAC CCG ACA TTA ACC 1507 

?S Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp Pro Thr Leu Thr 

m 455 460 465 

TCT CTG GCC AAG CTA ATG AAA GAA TGC TGG TAT CAA AAT CCA TCC GCA 1555 
Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin Asn Pro Ser Ala 
470 475 480 

AGA CTC ACA GCA CTG CGT ATC AAA AAG ACT TTG ACC AAA ATT GAT AAT 1603 
Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Thr Lys lie Asp Asn 
485 490 495 500 

TCC CTC GAC AAA TTG AAA ACT GAC TGT TGACATTTTC ATAGTGTCAA 1650 
Ser Leu Asp Lys Leu Lys Thr Asp Cys 

505 

GAAGGAAGAT TTGACGTTGT TGTCATTGTC CAGCTGGGAC CTAATGCTGG CCTGACTGGT 1710 



TGTCAGAATG GAATCCATCT GTCTCCCTCC CCAAATGGCT GCTTTGACAA GGCAGACGTC 1770 
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GTACCCAGCC 


ATGTGTTGGG 


GAGACATCAA 


AACCACCCTA 


ACCTCGCTCG 


ATGACTGTGA 


1830 


ACTGGGCATT 


TCACGAACTG 


TTCACACTGC 


AGAGACTAAT 


GTTGGACAGA 


CACTGTTGCA 


1890 


AAGGTAGGGA 


CTGGAGGAAC 


ACAGAGAAAT 


CCTAAAAGAG 


ATCTGGGCAT 


TAAGTCAGTG 


1950 


GCTTTGCATA 


GCTTTCACAA 


GTCTCCTAGA 


CACTCCCCAC 


GGGAAACTCA 


AGGAGGTGGT 


2010 


GAATTTTTAA 


TCAGCAATAT 


TGCCTGTGCT 


TCTCTTCTTT 


ATTGCACTAG 


GAATTCTTTG 


2070 


CATTCCTTAC 


TTGCACTGTT 


ACTCTTAATT 


TTAAAGACCC 


AACTTGCCAA 


AATGTTGGCT 


2130 


GCGTACTCCA 


CTGGTCTGTC 


TTTGGATAAT 


AGGAATTCAA 


TTTGGCAAAA 


CAAAATGTAA 


2190 


TGTCAGACTT 


TGCTGCATTT 


TACACATGTG 


CTGATGTTTA 


CAATGATGCC 


GAACATTAGG 


2250 


AATTGTTTAT 


ACACAACTTT 


GCAAATTATT 


TATTACTTGT 


GCACTTAGTA 


GTTTTTACAA 


2310 


_ AACTGCTTTG 

5=53 


TGCATATGTT 


AAAGCTTATT 


TTTATGTGGT 


CTTATGATTT 


TATTACAGAA 


2370 


]Jf ATGTTTTTAA 


CACTATACTC 


TAAAATGGAC 


ATTTTCTTTT 


ATTATCAGTT 


AAAATCACAT 


2430 


W TTTAAGTGCT 


TCACATTTGT 


ATGTGTGTAG 


ACTGTAACTT 


TTTTTCAGTT 


CATATGCAGA 


2490 


M ACGTATTTAG 


CCATTACCCA 


CGTGACACCA 


CCGAATATAT 


TATCGATTTA 


GAAGCAAAGA 


2550 


S| TTTCAGTAGA 


ATTTTAGTCC 


TGAACGCTAC 


GGGGAAAATG 


CATTTTCTTC 


AGAATTATCC 


2610 


a 

p ATTACGTGCA 


TTTAAACTCT 


GCCAGAAAAA 


AATAACTATT 


TTGTTTTAAT 


CTACTTTTTG 


2670 


Q TATTTAGTAG 


TTATTTGTAT 


AAATTAAATA 


AACTGTTTTC 


AAGTCAAAAA 


AAAA 


2724 



w (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Asp Gly Val Met lie Leu Pro Val Leu lie Met lie Ala Leu 
15 10 15 

Pro Ser Pro Ser Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu 
20 25 30 



Tyr Met Cys Val Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys 
35 40 45 



* 
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Glu Gly Gin Gin Cys Phe Ser Ser Leu Ser lie Asn Asp Gly Phe His 
50 55 60 

Val Tyr Gin Lys Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr 
65 70 75 80 

Cys Lys Thr Pro Pro Ser Pro Gly Gin Ala Val Glu Cys Cys Gin Gly 

85 90 95 

Asp Trp Cys Asn Arg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys 
100 105 110 

Ser Phe Pro Gly Thr Gin Asn Phe His Leu Glu Val Gly Leu lie lie 
115 120 125 

Leu Ser Val Val Phe Ala Val Cys Leu Leu Ala Cys Leu Leu Gly Val 
130 135 140 

Ala Leu Arg Lys Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg 



HAsp Val Glu Tyr Gly Thr lie Glu Gly Leu lie Thr Thr Asn Val Gly 
% 165 170 175 

test 

T". Asp Ser Thr Leu Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser 
J 180 185 190 

^ Gly Ser Gly Leu Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin lie 
J=J 195 200 205 

H= Thr Leu Leu Glu Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg 

U 210 215 220 

Oj Gly Ser Trp Gin Gly Glu Asn Val Ala Val Lys lie Phe Ser Ser Arg 

225 230 235 240 

Asp Glu Lys Ser Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met 

245 250 255 

Leu Arg His Glu Asn lie Leu Gly Phe lie Ala Ser Asp Met Thr Ser 
260 265 270 

Arg His Ser Ser Thr Gin Leu Trp Leu lie Thr His Tyr His Glu Met 
275 280 285 

Gly Ser Leu Tyr Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser 
290 295 300 

Cys Leu Arg lie Val Leu Ser lie Ala Ser Gly Leu Ala His Leu His 
305 310 315 320 



.« 145 



150 



155 



160 
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lie Glu lie Phe Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp 

325 330 335 

Leu Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Gin Cys Cys lie 
340 345 350 

Ala Asp Leu Gly Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu 
355 360 365 

Asp Val Gly Asn Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro 
370 375 380 

Glu Val Leu Asp Glu Thr lie Gin Val Asp Cys Phe Asp Ser Tyr Lys 
385 390 395 400 

Arg Val Asp lie Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg 

405 410 415 

Arg Met Val Ser Asn Gly lie Val Glu Asp Tyr Lys Pro Pro Phe Tyr 
420 425 430 

Asp Val Val Pro Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val 
435 440 445 

Cys Val Asp Gin Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp 
450 455 460 

Pro Thr Leu Thr Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin 
465 470 475 480 

Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Thr 



485 



490 



495 



Lys 



lie Asp Asn Ser Leu Asp Lys Leu 
500 * 505 



Lys 



Thr Asp 



Cys 



52 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 310.. 1905 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTCCGCGCC GAGGGCTGGA GGATGCGTTC CCTGGGGTCC GGACTTATGA AAATATGCAT 

CAGTTTAATA CTGTCTTGGA ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTTGGAGAA 

AATCAGAAGT ACAGTTTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 

TTGAAGTCAT TGTCAAGTGC TTGCGATCTT TTACAAGAAA ATCTCACTGA ATGATAGTCA 

TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAAGGTGA CAGTACACAG GAAACATTAC 

AATTGAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTG GGA GCC 
Met Thr Gin Leu Tyr lie Tyr lie Arg Leu Leu Gly Ala 
15 10 

TAT TTG TTC ATC ATT TCT CGT GTT CAA GGA CAG AAT CTG GAT AGT ATG 
Tyr Leu Phe lie lie Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met 

15 20 25 

CTT CAT GGC ACT GGG ATG AAA TCA GAC TCC GAC CAG AAA AAG TCA GAA 
Leu His Gly Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu 
30 35 40 45 

AAT GGA GTA ACC TTA GCA CCA GAG GAT ACC TTG CCT TTT TTA AAG TGC 
Asn Gly Val Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys 

50 55 60 

TAT TGC TCA GGG CAC TGT CCA GAT GAT GCT ATT AAT AAC ACA TGC ATA 
Tyr Cys Ser Gly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie 
65 70 75 

ACT AAT GGA CAT TGC TTT GCC ATC ATA GAA GAA GAT GAC CAG GGA GAA 
Thr Asn Gly His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu 
80 85 90 
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ACC ACA TTA GCT TCA GGG TGT ATG AAA TAT GAA GGA TCT GAT TTT CAG 636 

Thr Thr Leu Ala Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin 
95 100 105 

TGC AAA GAT TCT CCA AAA GCC CAG CTA CGC CGG ACA ATA GAA TGT TGT 684 

Cys Lys Asp Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys 
110 115 120 125 

CGG ACC AAT TTA TGT AAC CAG TAT TTG CAA CCC ACA CTG CCC CCT GTT 732 

Arg Thr Asn Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val 

130 135 140 

GTC ATA GGT CCG TTT TTT GAT GGC AGC ATT CGA TGG CTG GTT TTG CTC 780 

Val lie Gly Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Leu Leu 

145 150 155 

ATT TCT ATG GCT GTC TGC ATA ATT GCT ATG ATC ATC TTC TCC AGC TGC 828 

lie Ser Met Ala Val Cys lie lie Ala Met lie lie Phe Ser Ser Cys 

^ 160 165 170 

jrj TTT TGT TAC AAA CAT TAT TGC AAG AGC ATC TCA AGC AGA CGT CGT TAG 876 

HPhe Cys Tyr Lys His Tyr Cys Lys Ser lie Ser Ser Arg Arg Arg Tyr 
*jj 175 180 185 

ft AAT CGT GAT TTG GAA CAG GAT GAA GCA TTT ATT CCA GTT GGA GAA TCA 924 

^Asn Arg Asp Leu Glu Gin Asp Glu Ala Phe lie Pro Val Gly Glu Ser 
^190 195 200 205 

D CTA AAA GAC CTT ATT GAC CAG TCA CAA AGT TCT GGT AGT GGG TCT GGA 972 

UJ Leu Lys Asp Leu lie Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly 
M< 210 215 220 

h CTA CCT TTA TTG GTT CAG CGA ACT ATT GCC AAA CAG ATT CAG ATG GTC 1020 

ffl Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met Val 
225 230 235 

CGG CAA GTT GGT AAA GGC CGA TAT GGA GAA GTA TGG ATG GGC AAA TGG 1068 

Arg Gin Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp 

240 245 250 

CGT GGC GAA AAA GTG GCG GTG AAA GTA TTC TTT ACC ACT GAA GAA GCC 1116 

Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala 
255 260 265 

AGC TGG TTT CGA GAA ACA GAA ATC TAC CAA ACT GTG CTA ATG CGC CAT 1164 

Ser Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His 
270 275 280 285 

GAA AAC ATA CTT GGT TTC ATA GCG GCA GAC ATT AAA GGT ACA GGT TCC 1212 

Glu Asn lie Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly Ser 

290 295 300 
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TGG ACT CAG CTC TAT TTG ATT ACT GAT TAC CAT GAA AAT GGA TCT CTC 1260 
Trp Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu 
305 310 315 

TAT GAC TTC CTG AAA TGT GCT ACA CTG GAC ACC AGA GCC CTG CTT AAA 1308 
Tyr Asp Phe Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys 
320 325 330 

TTG GCT TAT TCA GCT GCC TGT GGT CTG TGC CAC CTG CAC ACA GAA ATT 1356 
Leu Ala Tyr Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu lie 
335 340 345 

TAT GGC ACC CAA GGA AAG CCC GCA ATT GCT CAT CGA GAC CTA AAG AGC 1404 
Tyr Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser 
350 355 . 360 365 

AAA AAC ATC CTC ATC AAG AAA AAT GGG AGT TGC TGC ATT GCT GAC CTG 1452 
Lys Asn lie Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu 
n 370 375 380 

g GGC CTT GCT GTT AAA TTC AAC AGT GAC ACA AAT GAA GTT GAT GTG CCC 1500 
riGly Leu Ala Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp Val Pro 
H 385 390 395 

JjTTG AAT ACC AGG GTG GGC ACC AAA CGC TAC ATG GCT CCC GAA GTG CTG 1548 
,lLeu Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 

'-- 400 405 410 

QGAC GAA AGC CTG AAC AAA AAC CAC TTC CAG CCC TAC ATC ATG GCT GAC 1596 
^Asp Glu Ser Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp 
& 415 420 425 

C=ATC TAC AGC TTC GGC CTA ATC ATT TGG GAG ATG GCT CGT CGT TGT ATC 1644 
CO lie Tyr Ser Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie 
430 435 440 445 

ACA GGA GGG ATC GTG GAA GAA TAC CAA TTG CCA TAT TAC AAC ATG GTA 1692 
Thr Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val 

450 455 460 

CCG AGT GAT CCG TCA TAC GAA GAT ATG CGT GAG GTT GTG TGT GTC AAA 1740 
Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys 
465 470 475 

CGT TTG CGG CCA ATT GTG TCT AAT CGG TGG AAC AGT GAT GAA TGT CTA 1788 
Arg Leu Arg Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu 
480 485 490 

CGA GCA GTT TTG AAG CTA ATG TCA GAA TGC TGG GCC CAC AAT CCA GCC 1836 
Arg Ala Val Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala 
495 500 505 
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TCC A6A CTC ACA 6CA TTG AGA ATT AAG AAG ACG CTT GCC AAG ATG GTT 1884 
Ser Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val 
510 515 520 525 

GAA TCC CAA GAT GTA AAA ATC TGATGGTTAA ACCATCGGAG GAGAAACTCT 1935 
Glu Ser Gin Asp Val Lys lie 

530 

AGACTGCAAG AACTGTTTTT ACCCATGGCA TGGGTGGAAT TAGAGTGGAA TAAGGATGTT 1995 

AACTTGGTTC TCAGACTCTT TCTTCACTAC GTGTTCACAG GCTGCTAATA TTAAACCTTT 2055 

CAGTACTCTT ATTAGGATAC AAGCTGGGAA CTTCTAAACA CTTCATTCTT TATATATGGA 2115 

CAGCTTTATT TTAAATGTGG TTTTTGATGC CTTTTTTTAA GTGGGTTTTT ATGAACTGCA 2175 

TCAAGACTTC AATCCTGATT AGTGTCTCCA GTCAAGCTCT GGGTACTGAA TTGCCTGTTC 2235 

ATAAAACGGT GCTTTCTGTG AAAGCCTTAA GAAGATAAAT GAGCGCAGCA GAGATGGAGA 2295 

AATAGACTTT GCCTTTTACC TGAGACATTC AGTTCGTTTG TATTCTACCT TTGTAAAACA 2355 

GCCTATAGAT GATGATGTGT TTGGGATACT GCTTATTTTA TGATAGTTTG TCCTGTGTCC 2415 

TTAGTGATGT GTGTGTGTCT CCATGCACAT GCACGCCGGG ATTCCTCTGC TGCCATTTGA 2475 

ATTAGAAGAA AATAATTTAT ATGCATGCAC AGGAAGATAT TGGTGGCCGG TGGTTTTGTG 2535 

CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTGCA 2595 

AGTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 2655 

AGAAGTTTAA AGCATCTGTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTGTGG 2715 

TTATTATTTT TGTCACGGAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 2775 

ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT GCATTTGATA GCAATGTAAG 2835 

TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGGG AAGTTATTTA 2895 

TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 2932 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Thr Gin Leu Tyr lie Tyr lie Arg Leu Leu Gly Ala Tyr Leu Phe 
15 10 15 

lie lie Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
20 25 30 

Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu Asn Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 

*>? 50 55 60 

SGly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie Thr Asn Gly 

Tf, 65 70 75 80 

^His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 
G 85 90 95 

% Ala Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
~ 100 105 110 

W Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys Arg Thr Asn 
f=: 115 120 125 

C'Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val lie Gly 
W 130 135 140 

Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Leu Leu lie Ser Met 
145 150 155 160 

Ala Val Cys lie lie Ala Met lie lie Phe Ser Ser Cys Phe Cys Tyr 

165 170 175 

Lys His Tyr Cys Lys Ser lie Ser Ser Arg Arg Arg Tyr Asn Arg Asp 
180 185 190 

Leu Glu Gin Asp Glu Ala Phe lie Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

Leu lie Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 
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Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 

245 250 255 

Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 285 

Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 

325 330 335 

Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu lie Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys Asn lie 
355 360 365 

Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp Val Pro Leu Asn Thr 
385 390 395 400 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 

405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp lie Tyr Ser 
420 425 430 

Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie Thr Gly Gly 
435 440 445 

lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 

485 490 495 
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Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

Asp Val Lys lie 
530 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1515 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT GTT GTC CTC 
Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG GGG GTC CAG GCT CTG 
Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
20 25 30 

CTG TGT GCG TGC ACC AGC TGC CTC CAG GCC AAC TAC ACG TGT GAG ACA 
Leu Cys Ala Cys Thr Ser Cys Leu Gin Ala Asn Tyr Thr Cys Glu Thr 
35 40 45 

GAT GGG GCC TGC ATG GTT TCC TTT TTC AAT CTG GAT GGG ATG GAG CAC 
Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Met Glu His 
50 55 60 

CAT GTG CGC ACC TGC ATC CCC AAA GTG GAG CTG GTC CCT GCC GGG AAG 
His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 
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CCC TTC TAC TGC CTG AGC TCG GAG GAC CTG CGC AAC ACC CAC TGC TGC 288 

Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 

85 90 95 

TAC ACT GAC TAC TGC AAC AGG ATC GAC TTG AGG GTG CCC AGT GGT CAC 336 
Tyr Thr Asp Tyr Cys Asn Arg lie Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

CTC AAG GAG CCT GAG CAC CCG TCC ATG TGG GGC CCG GTG GAG CTG GTA 384 
Leu Lys Glu Pro Glu His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

GGC ATC ATC GCC GGC CCG GTG TTC CTC CTG TTC CTC ATC ATC ATC ATT 432 
Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie lie lie 
130 -135 140 

GTT TTC CTT GTC ATT AAC TAT CAT CAG CGT GTC TAT CAC AAC CGC CAG 480 
Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
1145 150 155 160 

iAGA CTG GAC ATG GAA GAT CCC TCA TGT GAG ATG TGT CTC TCC AAA GAC 528 
I Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 

165 170 175 

! AAG ACG CTC CAG GAT CTT GTC TAC GAT CTC TCC ACC TCA GGG TCT GGC 576 
i Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

TCA GGG TTA CCC CTC TTT GTC CAG CGC ACA GTG GCC CGA ACC ATC GTT 624 
Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

TTA CAA GAG ATT ATT GGC AAG GGT CGG TTT GGG GAA GTA TGG CGG GGC 672 
Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

CGC TGG AGG GGT GGT GAT GTG GCT GTG AAA ATA TTC TCT TCT CGT GAA 720 
Arg Trp Arg Gly Gly Asp Val Ala Val Lys lie Phe Ser Ser Arg Glu 
225 230 235 240 

GAA CGG TCT TGG TTC AGG GAA GCA GAG ATA TAC CAG ACG GTC ATG CTG 768 
Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu 

245 250 255 

CGC CAT GAA AAC ATC CTT GGA TTT ATT GCT GCT GAC AAT AAA GAT AAT 816 
Arg His Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn Lys Asp Asn 
260 265 270 

GGC ACC TGG ACA CAG CTG TGG CTT GTT TCT GAC TAT CAT GAG CAC GGG 864 
Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
275 280 285 
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TCC CTG TTT GAT TAT CTG AAC CGG TAC ACA GTG ACA ATT GAG GGG ATG 912 
Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
290 295 300 

ATT AAG CTG GCC TTG TCT GCT GCT AGT GGG CTG GCA CAC CTG CAC ATG 960 
lie Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
305 310 315 320 

GAG ATC GTG GGC ACC CAA GGG AAG CCT GGA ATT GCT CAT CGA GAC TTA 1008 
Glu lie Val Gly Thr Gin Gly Lys Pro Gly lie Ala His Arg Asp Leu 

325 330 335 

AAG TCA AAG AAC ATT CTG GTG AAG AAA AAT GGC ATG TGT GCC ATA GCA 1056 
Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Met Cys Ala lie Ala 
340 345 350 

GAC CTG GGC CTG GCT GTC CGT CAT GAT GCA GTC ACT GAC ACC ATT GAC 1104 
Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr lie Asp 
« 355 360 365 

I s 

gATT GCC CCG AAT CAG AGG GTG GGG ACC AAA CGA TAC ATG GCC CCT GAA 1152 
rllle Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
% 370 375 380 

JjGTA CTT GAT GAA ACC ATT AAT ATG AAA CAC TTT GAC TCC TTT AAA TGT 1200 
IVal Leu Asp Glu Thr lie Asn Met Lys His Phe Asp Ser Phe Lys Cys 
*^385 390 395 400 

©GCT GAT ATT TAT GCC CTC GGG CTT GTA TAT TGG GAG ATT GCT CGA AGA 1248 

y^Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie Ala Arg Arg 
M; 405 410 415 

CTGC AAT TCT GGA GGA GTC CAT GAA GAA TAT CAG CTG CCA TAT TAC GAC 1296 

03 Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

TTA GTG CCC TCT GAC CCT TCC ATT GAG GAA ATG CGA AAG GTT GTA TGT 1344 
Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

GAT CAG AAG CTG CGT CCC AAC ATC CCC AAC TGG TGG CAG AGT TAT GAG 1392 
Asp Gin Lys Leu Arg Pro Asn lie Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

GCA CTG CGG GTG ATG GGG AAG ATG ATG CGA GAG TGT TGG TAT GCC AAC 1440 
Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

GGC GCA GCC CGC CTG ACG GCC CTG CGC ATC AAG AAG ACC CTC TCC CAG 1488 
Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 

485 490 495 



61 

CTC AGC GTG CAG GAA GAC GTG AAG ATC TAACTGCTCC CTCTCTCCAC 1535 
Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 

ACGGAGCTCC TGGCAGCGAG AACTACGCAC AGCTGCCGCG TTGAGCGTAC GATGGAGGCC 1595 

TACCTCTCGT TTCTGCCCAG CCCTCTGTGG CCAGGAGCCC TGGCCCGCAA GAGGGACAGA 1655 

GCCCGGGAGA GACTCGCTCA CTCCCATGTT GGGTTTGAGA CAGACACCTT TTCTATTTAC 1715 

CTCCTAATGG CATGGAGACT CTGAGAGCGA ATTGTGTGGA GAACTCAGTG CCACACCTCG 1775 

AACTGGTTGT AGTGGGAAGT CCCGCGAAAC CCGGTGCATC TGGCACGTGG CCAGGAGCCA 1835 

TGACAGGGGC GCTTGGGAGG GGCCGGAGGA ACCGAGGTGT TGCCAGTGCT AAGCTGCCCT 1895 

GAGGGTTTCC TTCGGGGACC AGCCCACAGC ACACCAAGGT GGCCCGGAAG AACCAGAAGT 1955 

0 GCAGCCCCTC TCACAGGCAG CTCTGAGCCG CGCTTTCCCC TCCTCCCTGG GATGGACGCT 2015 

JKGCCGGGAGAC TGCCAGTGGA GACGGAATCT GCCGCTTTGT CTGTCCAGCC GTGTGTGCAT 2075 

^GTGCCGAGGT GCGTCCCCCG TTGTGCCTGG TTCGTGCCAT GCCCTTACAC GTGCGTGTGA 2135 

^GTGTGTGTGT GTGTCTGTAG GTGCGCACTT ACCTGCTTGA GCTTTCTGTG CATGTGCAGG 2195 

^ TCGGGGGTGT GGTCGTCATG CTGTCCGTGC TTGCTGGTGC CTCTTTTCAG TAGTGAGCAG 2255 

pCATCTAGTTT CCCTGGTGCC CTTCCCTGGA GGTCTCTCCC TCCCCCAGAG CCCCTCATGC 2315 

N=CACAGTGGTA CTCTGTGT 2333 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 



Leu Cys Ala Cys Thr Ser Cys Leu 
35 40 



Gin 



Ala Asn Tyr Thr Cys Glu Thr 
45 
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Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Met Glu His 
50 55 60 

His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 

85 90 95 

Tyr Thr Asp Tyr Cys Asn Arg lie Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

Leu Lys Glu Pro Glu His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie lie lie 
130 135 140 

=Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
*gl45 150 155 160 

"Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 
% 165 170 175 

= Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
* 180 185 190 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
5 195 200 205 

*Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
J 210 215 220 

3 Arg Trp Arg Gly Gly Asp Val Ala Val Lys lie Phe Ser Ser Arg Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu 

245 250 255 

Arg His Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn Lys Asp Asn 
260 265 270 

Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
290 295 300 

lie Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
305 310 315 320 
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Glu lie Val Gly Thr Gin Gly Lys Pro Gly lie Ala His Arg Asp Leu 

325 330 335 

Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Met Cys Ala lie Ala 
340 345 350 

Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr lie Asp 
355 360 365 

lie Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

Val Leu Asp Glu Thr lie Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390 395 400 

Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie Ala Arg Arg 

405 410 415 

Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

Asp Gin Lys Leu Arg Pro Asn lie Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 

485 490 495 

Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 77.. 1585 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGCGAGGCGA GGTTTGCTGG GGTGAGGCAG CGGCGCGGCC GGGCCGGGCC GGGCCACAGG 60 

5 CGGTGGCGGC GGGACC ATG GAG GCG GCG GTC GCT GCT CCG CGT CCC CGG 109 
n Met Glu Ala Ala Val Ala Ala Pro Arg Pro Arg 

W 1 5 10 

MCTG CTC CTC CTC GTG CTG GCG GCG GCG GCG GCG GCG GCG GCG GCG CTG 157 
si Leu Leu Leu Leu Val Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu 
15 20 25 

□CTC CCG GGG GCG ACG GCG TTA CAG TGT TTC TGC CAC CTC TGT ACA AAA 205 
= = :Leu Pro Gly Ala Thr Ala Leu Gin Cys Phe Cys His Leu Cys Thr Lys 
U 30 35 40 

GAC AAT TTT ACT TGT GTG ACA GAT GGG CTC TGC TTT GTC TCT GTC ACA 253 
2; Asp Asn Phe Thr Cys Val Thr Asp Gly Leu Cys Phe Val Ser Val Thr 
45 50 55 

GAG ACC ACA GAC AAA GTT ATA CAC AAC AGC ATG TGT ATA GCT GAA ATT 301 
Glu Thr Thr Asp Lys Val lie His Asn Ser Met Cys lie Ala Glu lie 
60 65 70 75 

GAC TTA ATT CCT CGA GAT AGG CCG TTT GTA TGT GCA CCC TCT TCA AAA 349 
Asp Leu lie Pro Arg Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lys 

80 85 90 

ACT GGG TCT GTG ACT ACA ACA TAT TGC TGC AAT CAG GAC CAT TGC AAT 397 
Thr Gly Ser Val Thr Thr Thr Tyr Cys Cys Asn Gin Asp His Cys Asn 
95 100 105 

AAA ATA GAA CTT CCA ACT ACT GTA AAG TCA TCA CCT GGC CTT GGT CCT 445 
Lys lie Glu Leu Pro Thr Thr Val Lys Ser Ser Pro Gly Leu Gly Pro 
110 115 120 
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GTG GAA CTG GCA GCT GTC ATT GCT GGA CCA GTG TGC TTC GTC TGC ATC 
Val Glu Leu Ala Ala Val He Ala Gly Pro Val Cys Phe Val Cys He 
125 130 135 

TCA CTC ATG TTG ATG GTC TAT ATC TGC CAC AAC CGC ACT GTC ATT CAC 
Ser Leu Met Leu Met Val Tyr He Cys His Asn Arg Thr Val He His 
140 145 150 155 

CAT CGA GTG CCA AAT GAA GAG GAC CCT TCA TTA GAT CGC CCT TTT ATT 
His Arg Val Pro Asn Glu Glu Asp Pro Ser Leu Asp Arg Pro Phe He 

160 165 170 

TCA GAG GGT ACT ACG TTG AAA GAC TTA ATT TAT GAT ATG ACA ACG TCA 
Ser Glu Gly Thr Thr Leu Lys Asp Leu He Tyr Asp Met Thr Thr Ser 
175 180 185 

GGT TCT GGC TCA GGT TTA CCA TTG CTT GTT CAG AGA ACA ATT GCG AGA 
Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr He Ala Arg 
190 195 200 

"ACT ATT GTG TTA CAA GAA AGC ATT GGC AAA GGT CGA TTT GGA GAA GTT 
Thr He Val Leu Gin Glu Ser He Gly Lys Gly Arg Phe Gly Glu Val 
205 210 215 

j TGG AGA GGA AAG TGG CGG GGA GAA GAA GTT GCT GTT AAG ATA TTC TCC 
Trp Arg Gly Lys Trp Arg Gly Glu Glu Val Ala Val Lys He Phe Ser 
220 225 230 235 

TCT AGA GAA GAA CGT TCG TGG TTC CGT GAG GCA GAG ATT TAT CAA ACT 
Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr 

240 245 250 

GTA ATG TTA CGT CAT GAA AAC ATC CTG GGA TTT ATA GCA GCA GAC AAT 
Val Met Leu Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn 
255 260 265 

AAA GAC AAT GGT ACT TGG ACT CAG CTC TGG TTG GTG TCA GAT TAT CAT 
Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
270 . 275 280 

GAG CAT GGA TCC CTT TTT GAT TAC TTA AAC AGA TAC ACA GTT ACT GTG 
Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val 
285 290 295 

GAA GGA ATG ATA AAA CTT GCT CTG TCC ACG GCG AGC GGT CTT GCC CAT 
Glu Gly Met He Lys Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His 
300 305 310 315 

CTT CAC ATG GAG ATT GTT GGT ACC CAA GGA AAG CCA GCC ATT GCT CAT 
Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro Ala He Ala His 

320 325 330 
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AGA GAT TTG AAA TCA AAG AAT ATC TTG GTA AAG AAG AAT GGA ACT TGC 1117 
Arg Asp Leu Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys 
335 340 345 

TGT ATT GCA GAC TTA GGA CTG GCA GTA AGA CAT GAT TCA GCC ACA GAT 1165 
Cys lie Ala Asp Leu Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp 
350 355 360 

ACC ATT GAT ATT GCT CCA AAC CAC AGA GTG GGA ACA AAA AGG TAC ATG 1213 
Thr lie Asp lie Ala Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met 
365 370 375 

GCC CCT GAA GTT CTC GAT GAT TCC ATA AAT ATG AAA CAT TTT GAA TCC 1261 
Ala Pro Glu Val Leu Asp Asp Ser lie Asn Met Lys His Phe Glu Ser 
380 385 390 395 

TTC AAA CGT GCT GAC ATC TAT GCA ATG GGC TTA GTA TTC TGG GAA ATT 1309 
Phe Lys Arg Ala Asp lie Tyr Ala Met Gly Leu Val Phe Trp Glu lie 
p 400 405 410 

qGCT CGA CGA TGT TCC ATT GGT GGA ATT CAT GAA GAT TAC CAA CTG CCT 1357 
ijAla Arg Arg Cys Ser lie Gly Gly lie His Glu Asp Tyr Gin Leu Pro 
£ 415 420 425 

fjTAT TAT GAT CTT GTA CCT TCT GAC CCA TCA GTT GAA GAA ATG AGA AAA 1405 
■--.} Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys 
y 430 435 440 

^ GTT GTT TGT GAA CAG AAG TTA AGG CCA AAT ATC CCA AAC AGA TGG CAG 1453 
j«"Val Val Cys Glu Gin Lys Leu Arg Pro Asn lie Pro Asn Arg Trp Gin 
^ 445 450 455 

yjAGC TGT GAA GCC TTG AGA GTA ATG GCT AAA ATT ATG AGA GAA TGT TGG 1501 
ly Ser Cys Glu Ala Leu Arg Val Met Ala Lys lie Met Arg Glu Cys Trp 
460 465 470 475 

TAT GCC AAT GGA GCA GCT AGG CTT ACA GCA TTG CGG ATT AAG AAA ACA 1549 
Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr 

480 485 490 

TTA TCG CAA CTC AGT CAA CAG GAA GGC ATC AAA ATG TAATTCTACA 1595 
Leu Ser Gin Leu Ser Gin Gin Glu Gly lie Lys Met 
495 500 

GCTTTGCCTG AACTCTCCTT TTTTCTTCAG ATCTGCTCCT GGGTTTTAAT TTGGGAGGTC 1655 

AGTTGTTCTA CCTCACTGAG AGGGAACAGA AGGATATTGC TTCCTTTTGC AGCAGTGTAA 1715 

TAAAGTCAAT TAAAAACTTC CCAGGATTTC TTTGGACCCA GGAAACAGCC ATGTGGGTCC 1775 



TTTCTGTGCA CTATGAACGC TTCTTTCCCA GGACAGAAAA TGTGTAGTCT ACCTTTATTT 1835 



# 
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TTTATTAACA AAACTTGTTT TTTAAAAAGA TGATTGCTGG TCTTAACTTT AGGTAACTCT 1895 

GCTGTGCTGG AGATCATCTT TAAGGGCAAA GGAGTTGGAT TGCTGAATTA CAATGAAACA 1955 

TGTCTTATTA CTAAAGAAAG TGATTTACTC CTGGTTAGTA CATTCTCAGA GGATTCTGAA 2015 

CCACTAGAGT TTCCTTGATT CAGACTTTGA ATGTACTGTT CTATAGTTTT TCAGGATCTT 2075 

AAAACTAACA CTTATAAAAC TCTTATCTTG AGTCTAAAAA TGACCTCATA TAGTAGTGAG 2135 

GAACATAATT CATGCAATTG TATTTTGTAT ACTATTATTG TTCTTTCACT TATTCAGAAC 2195 

ATTACATGCC TTCAAAATGG GATTGTACTA TACCAGTAAG TGCCACTTCT GTGTCTTTCT 2255 

AATGGAAATG AGTAGAATTG CTGAAAGTCT CTATGTTAAA ACCTATAGTG TTT 2308 



^(2) INFORMATION FOR SEQ ID NO: 10: 

y (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 503 amino acids 

H (B) TYPE : amino acid 

W (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

f= (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

^Met Glu Ala Ala Val Ala Ala Pro Arg Pro Arg Leu Leu Leu Leu Val 

UJ Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu Leu Pro Gly Ala Thr 

M= 20 25 30 

ifi Ala Leu Gin Cys Phe Cys His Leu Cys Thr Lys Asp Asn Phe Thr Cys 
m 35 40 45 

Val Thr Asp Gly Leu Cys Phe Val Ser Val Thr Glu Thr Thr Asp Lys 
50 55 60 

Val lie His Asn Ser Met Cys lie Ala Glu lie Asp Leu lie Pro Arg 
65 70 75 80 

Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lys Thr Gly Ser Val Thr 

85 90 95 

Thr Thr Tyr Cys Cys Asn Gin Asp His Cys Asn Lys lie Glu Leu Pro 
100 105 110 

Thr Thr Val Lys Ser Ser Pro Gly Leu Gly Pro Val Glu Leu Ala Ala 
115 120 125 
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Val lie Ala Gly Pro Val Cys Phe Val Cys lie Ser Leu Met Leu Met 
130 135 140 

Val Tyr lie Cys His Asn Arg Thr Val lie His His Arg Val Pro Asn 
145 150 155 160 

Glu Glu Asp Pro Ser Leu Asp Arg Pro Phe lie Ser Glu Gly Thr Thr 

165 170 175 

Leu Lys Asp Leu lie Tyr Asp Met Thr Thr Ser Gly Ser Gly Ser Gly 
180 185 190 

Leu Pro Leu Leu Val Gin Arg Thr lie Ala Arg Thr lie Val Leu Gin 
195 200 205 

Glu Ser lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly Lys Trp 
210 215 220 

= Arg Gly Glu Glu Val Ala Val Lys lie Phe Ser Ser Arg Glu Glu Arg 
^225 230 235 240 

"^Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu Arg His 
; - 245 250 255 

Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn Lys Asp Asn Gly Thr 
260 265 270 

Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly Ser Leu 
275 280 285 

Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val Glu Gly Met lie Lys 
290 295 300 

Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His Leu His Met Glu lie 
305 310 315 320 

Val Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser 

325 330 335 

Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys lie Ala Asp Leu 
340 345 350 

Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp Thr lie Asp lie Ala 
355 360 365 

Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 
370 375 380 

Asp Asp Ser lie Asn Met Lys His Phe Glu Ser Phe Lys Arg Ala Asp 
385 390 395 400 
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lie Tyr Ala Met Gly Leu Val Phe Trp Glu lie Ala Arg Arg Cys Ser 

405 410 415 

lie Gly Gly lie His Glu Asp Tyr Gin Leu Pro Tyr Tyr Asp Leu Val 
420 425 430 

Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys Val Val Cys Glu Gin 
435 440 445 

Lys Leu Arg Pro Asn lie Pro Asn Arg Trp Gin Ser Cys Glu Ala Leu 
450 455 460 

Arg Val Met Ala Lys lie Met Arg Glu Cys Trp Tyr Ala Asn Gly Ala 
465 470 475 480 

Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin Leu Ser 

485 490 495 

Gin Gin Glu Gly He Lys Met 
500 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 241.. 1746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAGAGCACAG CCCTTCCCAG TCCCCGGAGC CGCCGCGCCA CGCGCGCATG ATCAAGACCT 

TTTCCCCGGC CCCACAGGGC CTCTGGACGT GAGACCCCGG CCGCCTCCGC AAGGAGAGGC 

GGGGGTCGAG TCGCCCTGTC CAAAGGCCTC AATCTAAACA ATCTTGATTC CTGTTGCCGG 

CTGGCGGGAC CCTGAATGGC AGGAAATCTC ACCACATCTC TTCTCCTATC TCCAAGGACC 

ATG ACC TTG GGG AGC TTC AGA AGG GGC CTT TTG ATG CTG TCG GTG GCC 
Met Thr Leu Gly Ser Phe Arg Arg Gly Leu Leu Met Leu Ser Val Ala 
15 10 15 
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TTG GGC CTA ACC CAG GGG AGA CTT GCG AAG CCT TCC AAG CTG GTG AAC 
Leu Gly Leu Thr Gin Gly Arg Leu Ala Lys Pro Ser Lys Leu Val Asn 
20 25 30 

TGC ACT TGT GAG AGC CCA CAC TGC AAG AGA CCA TTC TGC CAG GGG TCA 
Cys Thr Cys Glu Ser Pro His Cys Lys Arg Pro Phe Cys Gin Gly Ser 
35 40 45 

TGG TGC ACA GTG GTG CTG GTT CGA GAG CAG GGC AGG CAC CCC CAG GTC 
Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg His Pro Gin Val 
50 55 60 

TAT CGG GGC TGT GGG AGC CTG AAC CAG GAG CTC TGC TTG GGA CGT CCC 
Tyr Arg Gly Cys Gly Ser Leu Asn Gin Glu Leu Cys Leu Gly Arg Pro 
65 70 75 80 

ACG GAG TTT CTG AAC CAT CAC TGC TGC TAT AGA TCC TTC TGC AAC CAC 
Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Phe Cys Asn His 

85 90 95 

J AAC GTG TCT CTG ATG CTG GAG GCC ACC CAA ACT CCT TCG GAG GAG CCA 
Asn Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
j 100 105 110 

GAA GTT GAT GCC CAT CTG CCT CTG ATC CTG GGT CCT GTG CTG GCC TTG 
J Glu Val Asp Ala His Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 
115 120 125 

CCG GTC CTG GTG GCC CTG GGT GCT CTG GGC TTG TGG CGT GTC CGG CGG 
Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

AGG CAG GAG AAG CAG CGG GAT TTG CAC AGT GAC CTG GGC GAG TCC AGT 
Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Gly Glu Ser Ser 
145 150 155 160 

CTC ATC CTG AAG GCA TCT GAA CAG GCA GAC AGC ATG TTG GGG GAC TTC 
Leu lie Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 

165 170 175 

CTG GAC AGC GAC TGT ACC ACG GGC AGC GGC TCG GGG CTC CCC TTC TTG 
Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
180 185 190 

GTG CAG AGG ACG GTA GCT CGG CAG GTT GCG CTG GTA GAG TGT GTG GGA 
Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly 
195 200 205 

AAG GGC CGA TAT GGC GAG GTG TGG CGC GGT TCG TGG CAT GGC GAA AGC 
Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp His Gly Glu Ser 
210 215 220 
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GTG GCG GTC AAG ATT TTC TCC TCA CGA GAT GAG CAG TCC TGG TTC CGG 
Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg 
225 230 235 240 

GAG ACG GAG ATC TAC AAC ACA GTT CTG CTT AGA CAC GAC AAC ATC CTA 
Glu Thr Glu lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu 

245 250 255 

GGC TTC ATC GCC TCC GAC ATG ACT TCG CGG AAC TCG AGC ACG CAG CTG 
Gly Phe lie Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu 
260 265 270 

TGG CTC ATC ACC CAC TAC CAT GAA CAC GGC TCC CTC TAT GAC TTT CTG 
Trp Leu lie Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu 
275 280 285 

CAG AGG CAG ACG CTG GAG CCC CAG TTG GCC CTG AGG CTA GCT GTG TCC 
Gin Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val Ser 
290 295 300 

CCG GCC TGC GGC CTG GCG CAC CTA CAT GTG GAG ATC TTT GGC ACT GAA 
Pro Ala Cys Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin 
305 310 315 320 

GGC AAA CCA GCC ATT GCC CAT CGT GAC CTC AAG AGT CGC AAT GTG CTG 
Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Arg Asn Val Leu 

325 330 335 

GTC AAG AGT AAC TTG CAG TGT TGC ATT GCA GAC CTG GGA CTG GCT GTG 
Val Lys Ser Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val 
340 345 350 

ATG CAC TCA CAA AGC AAC GAG TAC CTG GAT ATC GGC AAC ACA CCC CGA 
Met His Ser Gin Ser Asn Glu Tyr Leu Asp lie Gly Asn Thr Pro Arg 
355 360 365 

GTG GGT ACC AAA AGA TAC ATG GCA CCC GAG GTG CTG GAT GAG CAC ATC 
Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu His lie 
370 375 380 

CGC ACA GAC TGC TTT GAG TCG TAC AAG TGG ACA GAC ATC TGG GCC TTT 
Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe 
385 390 395 400 

GGC CTA GTG CTA TGG GAG ATC GCC CGG CGG ACC ATC ATC AAT GGC ATT 
Gly Leu Val Leu Trp Glu lie Ala Arg Arg Thr lie lie Asn Gly lie 

405 410 415 

GTG GAG GAT TAC AGG CCA CCT TTC TAT GAC ATG GTA CCC AAT GAC CCC 
Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Met Val Pro Asn Asp Pro 
420 425 430 
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AGT TTT GAG GAC ATG AAA AAG GTG GTG TGC GTT GAC CAG CAG ACA CCC 1584 
Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 
435 440 445 

ACC ATC CCT AAC CGG CTG GCT GCA GAT CCG GTC CTC TCC GGG CTG GCC 1632 
Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
450 455 460 

CAG ATG ATG AGA GAG TGC TGG TAC CCC AAC CCC TCT GCT CGC CTC ACC 1680 
Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 
465 470 475 480 

GCA CTG CGC ATA AAG AAG ACA TTG CAG AAG CTC AGT CAC AAT CCA GAG 1728 
Ala Leu Arg lie Lys Lys Thr Leu Gin Lys Leu Ser His Asn Pro Glu 

485 490 495 

AAG CCC AAA GTG ATT CAC TAGCCCAGGG CCACCAGGCT TCCTCTGCCT 1776 
Lys Pro Lys Val lie His 
q 500 

AAAGTGTGTG CTGGGGAAGA AGACATAGCC TGTCTGGGTA GAGGGAGTGA AGAGAGTGTG 1836 

JiCACGCTGCCC TGTGTGTGCC TGCTCAGCTT GCTCCCAGCC CATCCAGCCA AAAATACAGC 1896 

K TGAGCTGAAA TTCAAAAAAA AAAAAA 1922 



I (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Thr Leu Gly Ser Phe Arg Arg Gly Leu Leu Met Leu Ser Val Ala 
1 5 10 15 

Leu Gly Leu Thr Gin Gly Arg Leu Ala Lys Pro Ser Lys Leu Val Asn 
20 25 30 

Cys Thr Cys Glu Ser Pro His Cys Lys Arg Pro Phe Cys Gin Gly Ser 
35 40 45 

Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg His Pro Gin Val 
50 55 60 

Tyr Arg Gly Cys Gly Ser Leu Asn Gin Glu Leu Cys Leu Gly Arg Pro 
65 70 75 80 
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Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Phe Cys Asn His 

85 90 95 

Asn Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

Glu Val Asp Ala His Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 
115 120 125 

Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Gly Glu Ser Ser 
145 150 155 160 

Leu lie Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 

165 170 175 

^Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
J 180 185 190 

Hval Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly 
= 195 200 205 

H Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp His Gly Glu Ser 
r\ 210 215 220 

L_ Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg 
H 225 230 235 240 

H Glu Thr Glu lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu 
W 245 250 255 

ffl Gly Phe lie Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu 
260 265 270 

Trp Leu lie Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu 
275 280 285 

Gin Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val Ser 
290 295 300 

Pro Ala Cys Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin 
305 310 315 320 

Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Arg Asn Val Leu 

325 330 335 

Val Lys Ser Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val 
340 345 350 
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Met His Ser Gin Ser Asn Glu Tyr Leu Asp lie Gly Asn Thr Pro Arg 
355 360 365 

Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu His lie 
370 375 380 

Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe 
385 .390 395 400 

Gly Leu Val Leu Trp Glu lie Ala Arg Arg Thr lie lie Asn Gly lie 

405 410 415 

Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Met Val Pro Asn Asp Pro 
420 425 430 

Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 
435 440 445 

Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
450 455 460 

Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 
465 470 475 480 

Ala Leu Arg lie Lys Lys Thr Leu Gin Lys Leu Ser His Asn Pro Glu 

485 490 495 

Lys Pro Lys Val lie His 
500 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2070 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 217.. 1812 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTCGGAGAA ATTGGAACTA CAGTTTTATC 
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TAGCCACATC TCTGAGAATT CTGAAGAAAG CAGCAGGTGA AAGTCATTGC CAAGTGATTT 

TGTTCTGTAA GGAAGCCTCC CTCATTCACT TACACCAGTG AGACAGCAGG ACCAGTCATT 

CAAAGGGCCG TGTACAGGAC GCGTGGCAAT GAGACA ATG ACT CAG CTA TAC ACT 

Met Thr Gin Leu Tyr Thr 
1 5 

TAC ATC AGA TTA CTG GGA GCC TGT CTG TTC ATC ATT TCT CAT GTT CAA 
Tyr lie Arg Leu Leu Gly Ala Cys Leu Phe lie lie Ser His Val Gin 
10 15 20 

GGG CAG AAT CTA GAT AGT ATG CTC CAT GGC ACT GGT ATG AAA TCA GAC 
Gly Gin Asn Leu Asp Ser Met Leu His Gly Thr Gly Met Lys Ser Asp 
25 30 35 

TTG GAC CAG AAG AAG CCA GAA AAT GGA GTG ACT TTA GCA CCA GAG GAT 
Leu Asp Gin Lys Lys Pro Glu Asn Gly Val Thr Leu Ala Pro Glu Asp 
40 45 50 

=ACC TTG CCT TTC TTA AAG TGC TAT TGC TCA GGA CAC TGC CCA GAT GAT 
Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser Gly His Cys Pro Asp Asp 
55 60 65 70 

GCT ATT AAT AAC ACA TGC ATA ACT AAT GGC CAT TGC TTT GCC ATT ATA 
Ala lie Asn Asn Thr Cys lie Thr Asn Gly His Cys Phe Ala lie lie 

75 80 85 

GAA GAA GAT GAT CAG GGA GAA ACC ACA TTA ACT TCT GGG TGT ATG AAG 
Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu Thr Ser Gly Cys Met Lys 
90 95 100 

TAT GAA GGC TCT GAT TTT CAA TGC AAG GAT TCA CCG AAA GCC CAG CTA 
Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp Ser Pro Lys Ala Gin Leu 
105 110 115 

CGC AGG ACA ATA GAA TGT TGT CGG ACC AAT TTG TGC AAC CAG TAT TTG 
Arg Arg Thr lie Glu Cys Cys Arg Thr Asn Leu Cys Asn Gin Tyr Leu 
120 125 130 

CAG CCT ACA CTG CCC CCT GTT GTT ATA GGT CCG TTC TTT GAT GGC AGC 
Gin Pro Thr Leu Pro Pro Val Val lie Gly Pro Phe Phe Asp Gly Ser 
135 140 145 150 

ATC CGA TGG CTG GTT GTG CTC ATT TCC ATG GCT GTC TGT ATA GTT GCT 
lie Arg Trp Leu Val Val Leu He Ser Met Ala Val Cys He Val Ala 

155 160 165 

ATG ATC ATC TTC TCC AGC TGC TTT TGC TAT AAG CAT TAT TGT AAG AGT 
Met He He Phe Ser Ser Cys Phe Cys Tyr Lys His Tyr Cys Lys Ser 
170 175 180 



76 



ATC TCA AGC AGG GGT CGT TAC AAC CGT GAT TTG GAA CAG GAT GAA GCA 
lie Ser Ser Arg Gly Arg Tyr Asn Arg Asp Leu Glu Gin Asp Glu Ala 
185 190 195 

TTT ATT CCA GTA GGA GAA TCA TTG AAA GAC CTG ATT GAC CAG TCC CAA 
Phe lie Pro Val Gly Glu Ser Leu Lys Asp Leu lie Asp Gin Ser Gin 
200 205 210 

AGC TCT GGG AGT GGA TCT GGA TTG CCT TTA TTG GTT CAG CGA ACT ATT 
Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr He 
215 220 225 230 

GCC AAA CAG ATT CAG ATG GTT CGG CAG GTT GGT AAA GGC CGC TAT GGA 
Ala Lys Gin He Gin Met Val Arg Gin Val Gly Lys Gly Arg Tyr Gly 

235 240 245 

GAA GTA TGG ATG GGT AAA TGG CGT GGT GAA AAA GTG GCT GTC AAA GTG 
Glu Val Trp Met Gly Lys Trp Arg Gly Glu Lys Val Ala Val Lys Val 
250 255 260 

= TTT TTT ACC ACT GAA GAA GCT AGC TGG TTT AGA GAA ACA GAA ATC TAC 
Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe Arg Glu Thr Glu He Tyr 
265 270 275 



CAG ACG GTG TTA ATG CGT CAT GAA AAT ATA CTT GGT TTT ATA GCT GCA 
Gin Thr Val Leu Met Arg His Glu Asn He Leu Gly Phe He Ala Ala 
280 285 290 

GAC ATT AAA GGC ACT GGT TCC TGG ACT CAG CTG TAT TTG ATT ACT GAT 
Asp He Lys Gly Thr Gly Ser Trp Thr Gin Leu Tyr Leu He Thr Asp 
295 300 305 310 

TAC CAT GAA AAT GGA TCT CTC TAT GAC TTC CTG AAA TGT GCC ACA CTA 
Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe Leu Lys Cys Ala Thr Leu 

315 320 325 

GAC ACC AGA GCC CTA CTC AAG TTA GCT TAT TCT GCT GCT TGT GGT CTG 
Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr Ser Ala Ala Cys Gly Leu 
330 335 340 

TGC CAC CTC CAC ACA GAA ATT TAT GGT ACC CAA GGG AAG CCT GCA ATT 
Cys His Leu His Thr Glu He Tyr Gly Thr Gin Gly Lys Pro Ala He 
345 350 355 

GCT CAT CGA GAC CTG AAG AGC AAA AAC ATC CTT ATT AAG AAA AAT GGA 
Ala His Arg Asp Leu Lys Ser Lys Asn He Leu He Lys Lys Asn Gly 
360 365 370 

AGT TGC TGT ATT GCT GAC CTG GGC CTA GCT GTT AAA TTC AAC AGT GAT 
Ser Cys Cys lie Ala Asp Leu Gly Leu Ala Val Lys Phe Asn Ser Asp 
375 380 385 390 




77 

ACA AAT 6AA GTT 6AC ATA CCC TTG AAT ACC A6G GTG GGC ACC AAG CGG 1434 
Thr Asn Glu Val Asp lie Pro Leu Asn Thr Arg Val Gly Thr Lys Arg 

395 400 405 

TAC ATG GCT CCA GAA GTG CTG GAT GAA AGC CTG AAT AAA AAC CAT TTC 1482 
Tyr Met Ala Pro Glu Val Leu Asp Glu Ser Leu Asn Lys Asn His Phe 
410 415 420 

CAG CCC TAC ATC ATG GCT GAC ATC TAT AGC TTT GGT TTG ATC ATT TGG 1530 
Gin Pro Tyr lie Met Ala Asp lie Tyr Ser Phe Gly Leu lie lie Trp 
425 430 435 

GAA ATG GCT CGT CGT TGT ATT ACA GGA GGA ATC GTG GAG GAA TAT CAA 1578 
Glu Met Ala Arg Arg Cys lie Thr Gly Gly He Val Glu Glu Tyr Gin 
440 445 450 

TTA CCA TAT TAC AAC ATG GTG CCC AGT GAC CCA TCC TAT GAG GAC ATG 1626 
Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp Pro Ser Tyr Glu Asp Met 
455 460 465 470 

CGT GAG GTT GTG TGT GTG AAA CGC TTG CGG CCA ATC GTG TCT AAC CGC 1674 
Arg Glu Val Val Cys Val Lys Arg Leu Arg Pro He Val Ser Asn Arg 

475 480 485 

TGG AAC AGC GAT GAA TGT CTT CGA GCA GTT TTG AAG CTA ATG TCA GAA 1722 
Trp Asn Ser Asp Glu Cys Leu Arg Ala Val Leu Lys Leu Met Ser Glu 
490 495 500 

TGT TGG GCC CAT AAT CCA GCC TCC AGA CTC ACA GCT TTG AGA ATC AAG 1770 
Cys Trp Ala His Asn Pro Ala Ser Arg Leu Thr Ala Leu Arg He Lys 
505 510 515 

AAG ACA CTT GCA AAA ATG GTT GAA TCC CAG GAT GTA AAG ATT 1812 
Lys Thr Leu Ala Lys Met Val Glu Ser Gin Asp Val Lys He 
520 525 530 

TGACAATTAA ACAATTTTGA GGGAGAATTT AGACTGCAAG AACTTCTTCA CCCAAGGAAT 1872 

GGGTGGGATT AGCATGGAAT AGGATGTTGA CTTGGTTTCC AGACTCCTTC CTCTACATCT 1932 

TCACAGGCTG CTAACAGTAA ACCTTACCGT ACTCTACAGA ATACAAGATT GGAACTTGGA 1992 

ACTTCAAACA TGTCATTCTT TATATATGAC AGCTTTGTTT TAATGTGGGG TTTTTTTGTT 2052 



TGCTTTTTTT GTTTTGTT 



2070 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Thr Gin Leu Tyr Thr Tyr lie Arg Leu Leu Gly Ala Cys Leu Phe 
15 10 15 

lie lie Ser His Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
20 25 30 

Thr Gly Met Lys Ser Asp Leu Asp Gin Lys Lys Pro Glu Asn Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 
50 55 60 

Gly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie Thr Asn Gly 
65 70 75 80 

His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 

85 90 95 

Thr Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 110 

Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys Arg Thr Asn 
115 120 125 

Leu Cys Asn. Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val lie Gly 
130 135 140 

Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Val Leu lie Ser Met 
145 150 155 160 

Ala Val Cys lie Val Ala Met lie lie Phe Ser Ser Cys Phe Cys Tyr 

165 170 175 

Lys His Tyr Cys Lys Ser lie Ser Ser Arg Gly Arg Tyr Asn Arg Asp 
180 185 190 

Leu Glu Gin Asp Glu Ala Phe lie Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

Leu lie Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 
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Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 

245 250 255 

Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His Glu Asn lie 
275 280 285 

Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 

325 330 335 

Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu lie Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys Asn lie 
355 360 365 

Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp lie Pro Leu Asn Thr 
385 390 395 400 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 

405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp lie Tyr Ser 
420 425 430 

Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie Thr Gly Gly 
435 440 445 

lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 

485 490 495 



# 
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Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

Asp Val Lys lie 
530 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



3 CGCGGTTAC ATG GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT 48 
j Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu 

115 10 

t GTT GTC CTC CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG GGG ATC 96 
if Val Val Leu Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie 
y 15 20 25 

CAG GCT CTG CTG TGT GCG TGC ACC AGC TGC CTA CAG ACC AAC TAC ACC 144 
Gin Ala Leu Leu Cys Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr Thr 
30 35 40 45 

TGT GAG ACA GAT GGG GCT TGC ATG GTC TCC ATC TTT AAC CTG GAT GGC 192 
Cys Glu Thr Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu Asp Gly 

50 55 60 

GTG GAG CAC CAT GTA CGT ACC TGC ATC CCC AAG GTG GAG CTG GTT CCT 240 
Val Glu His His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro 
65 70 75 

GCT GGA AAG CCC TTC TAC TGC CTG AGT TCA GAG GAT CTG CGC AAC ACA 288 
Ala Gly Lys Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr 
80 85 90 
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CAC TGC TGC TAT ATT GAC TTC TGC AAC AAG ATT GAC CTC AGG GTC CCC 336 
His Cys Cys Tyr lie Asp Phe Cys Asn Lys lie Asp Leu Arg Val Pro 
95 100 105 

AGC GGA CAC CTC AAG GAG CCT GCG CAC CCC TCC ATG TGG GGC CCT GTG 384 
Ser Gly His Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val 
110 115 120 125 

GAG CTG GTC GGC ATC ATC GCC GGC CCC GTC TTC CTC CTC TTC CTT ATC 432 
Glu Leu Val Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie 

130 135 140 

ATT ATC ATC GTC TTC CTG GTC ATC AAC TAT CAC CAG CGT GTC TAC CAT 480 
lie lie lie Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His 
145 150 155 

AAC CGC CAG AGG TTG GAC ATG GAG GAC CCC TCT TGC GAG ATG TGT CTC 528 
= Asn Arg Gin Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu 
J 160 165 170 

3 TCC AAA GAC AAG ACG CTC CAG GAT CTC GTC TAC GAC CTC TCC ACG TCA 576 
USer Lys Asp Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser 
B 175 180 185 

4 GGG TCT GGC TCA GGG TTA CCC CTT TTT GTC CAG CGC ACA GTG GCC CGA 624 
j Gly Ser Gly Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg 
190 195 200 205 

]ACC ATT GTT TTA CAA GAG ATT ATC GGC AAG GGC CGG TTC GGG GAA GTA 672 
l Thr lie Val Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val 

210 215 220 

i TGG CGT GGT CGC TGG AGG GGT GGT GAC GTG GCT GTG AAA ATC TTC TCT 720 
Trp Arg Gly Arg Trp Arg Gly Gly Asp Val Ala Val Lys lie Phe Ser 
225 230 235 

TCT CGT GAA GAA CGG TCT TGG TTC CGT GAA GCA GAG ATC TAC CAG ACC 768 
Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr 
240 245 250 

GTC ATG CTG CGC CAT GAA AAC ATC CTT GGC TTT ATT GCT GCT GAC AAT 816 
Val Met Leu Arg His Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn 
255 260 265 

AAA GAT AAT GGC ACC TGG ACC CAG CTG TGG CTT GTC TCT GAC TAT CAC 864 
Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
270 275 280 285 
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GAG CAT GGC TCA CTG TTT GAT TAT CTG AAC CGC TAC ACA GTG ACC ATT 
Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie 

290 295 300 

GAG GGA ATG ATT AAG CTA GCC TTG TCT GCA GCC AGT GGT TTG GCA CAC 
Glu Gly Met lie Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His 
305 310 315 

CTG CAT ATG GAG ATT GTG GGC ACT CAA GGG AAG CCG GGA ATT GCT CAT 
Leu His Met Glu lie Val Gly Thr Gin Gly Lys Pro Gly lie Ala His 
320 325 330 

CGA GAC TTG AAG TCA AAG AAC ATC CTG GTG AAA AAA AAT GGC ATG TGT 
Arg Asp Leu Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Met Cys 
335 340 345 

GCC ATT GCA GAC CTG GGC CTG GCT GTC CGT CAT GAT GCG GTC ACT GAC 
Ala lie Ala Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp 
n 350 355 360 365 

if ACC ATA GAC ATT GCT CCA AAT CAG AGG GTG GGG ACC AAA CGA TAC ATG 
HThr lie Asp lie Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met 
j£ 370 375 380 

H GCT CCT GAA GTC CTT GAC GAG ACA ATC AAC ATG AAG CAC TTT GAC TCC 
^< Ala Pro Glu Val Leu Asp Glu Thr lie Asn Met Lys His Phe Asp Ser 
^1 385 390 395 

O TTC AAA TGT GCC GAC ATC TAT GCC CTC GGG CTT GTC TAC TGG GAG ATT 
W Phe Lys Cys Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie 
U 400 405 410 

=J3 GCA CGA AGA TGC AAT TCT GGA GGA GTC CAT GAA GAC TAT CAA CTG CCG 
m Ala Arg Arg Cys Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro 
415 420 425 

TAT TAC GAC TTA GTG CCC TCC GAC CCT TCC ATT GAG GAG ATG CGA AAG 
Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys 
430 435 440 445 

GTT GTA TGT GAC CAG AAG CTA CGG CCC AAT GTC CCC AAC TGG TGG CAG 
Val Val Cys Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin 

450 455 460 

AGT TAT GAG GCC TTG CGA GTG ATG GGA AAG ATG ATG CGG GAG TGC TGG 
Ser Tyr Glu Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp 
465 470 475 

TAC GCC AAT GGT GCT GCC CGT CTG ACA GCT CTG CGC ATC AAG AAG ACT 
Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr 
480 485 490 
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CTG TCC CAG CTA AGC GTG CAG GAA GAT GTG AAG ATT TAAGCTGTTC 1534 
Leu Ser Gin Leu Ser Val Gin Glu Asp Val Lys lie 
495 500 505 



CTCTGCCTAC 


ACAAAGAACC 


TGGGCAGTGA 


GGATGACTGC 


AGCCACCGTG 


CAAGCGTCGT 


1594 


GGAGGCCTAT 


CCTCTTGTTT 


CTGCCCGGCC 


CTCTGGCAGA 


GCCCTGGCCT 


GCAAGAGGGA 


1654 


CAGAGCCTGG 


GAGACGCGCG 


CACTCCCGTT 


GGGTTTGAGA 


CAGACACTTT 


TTATATTTAC 


1714 


CTCCTGATGG 


CATGGAGACC 


TGAGCAAATC 


ATGTAGTCAC 


TCAATGCCAC 


AACTCAAACT 


1774 


GCTTCAGTGG 


GAAGTACAGA 


GACCCAGTGC 


ATTGCGTGTG 


CAGGAGCGTG 


AGGTGCTGGG 


1834 


CTCGCCAGGA 


GCGGCCCCCA 


TACCTTGTGG 


TCCACTGGGC 


TGCAGGTTTT 


CCTCCAGGGA 


1894 


CCAGTCAACT 


GGCATCAAGA 


TATTGAGAGG 


AACCGGAAGT 


TTCTCCCTCC 


TTCCCGTAGC 


1954 


_AGTCCTGAGC 


CACACCATCC 


TTCTCATGGA 


CATCCGGAGG 


ACTGCCCCTA 


GAGACACAAC 


2014 


tfJCTGCTGCCTG 


TCTGTCCAGC 


CAAGTGCGCA 


TGTGCCGAGG 


TGTGTCCCAC 


ATTGTGCCTG 


2074 


Wgtctgtgcca 


CGCCCGTGTG 


TGTGTGTGTG 


TGTGTGAGTG 


AGTGTGTGTG 


TGTACACTTA 


2134 


Macctgcttga 


GCTTCTGTGC 


ATGTGT 








2160 



|j(2) INFORMATION FOR SEQ ID NO: 16: 

lI (i) SEQUENCE CHARACTERISTICS: 

y (A) LENGTH: 505 amino acids 

.= (B) TYPE: amino acid 

~ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie Gin Ala Leu 
20 25 30 

Leu Cys Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr Thr Cys Glu Thr 
35 40 45 

Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu Asp Gly Val Glu His 
50 55 60 

His Val Arg Thr Cys He Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 
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Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 

85 90 95 

Tyr lie Asp Phe Cys Asn Lys He Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He He He He 
130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 150 155 160 

Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 

165 170 175 

Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
□ 180 185 190 

6 Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr He Val 
hj 195 200 205 

U Leu Gin Glu He He Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

""Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser Ser Arg Glu 
S=225 230 235 240 

^Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu 

245 250 255 

W 

fl Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn 

CO 260 265 270 

Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He Glu Gly Met 
290 295 300 

He Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
305 310 315 320 

Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His Arg Asp Leu 

325 330 335 

Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys Ala He Ala 
340 345 350 
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Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr lie Asp 
355 360 365 

He Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

Val Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390 395 400 

Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp Glu He Ala Arg Arg 

405 410 415 

Cys Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

s Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
; 465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Gin 

485 490 495 

Leu Ser Val Gin Glu Asp Val Lys He 
500 505 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1952 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 187.. 1692 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



AAGCGGCGGC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG GGCGCGGAGG ACCCGGGACC 
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TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC ATTTGGCGCT GAGCTATGAC 120 

AAGAGAGCAA ACAAAAAGTT AAAGGAGCAA CCCGGCCATA AGTGAAGAGA GAAGTTTATT 180 

GATAAC ATG CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC ACC AAG 228 
Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys 
1 5 10 

AAG GAG GAT GGA GAG AGT ACA GCC CCC ACC CCT CGG CCC AAG ATC CTA 276 
Lys Glu Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys lie Leu 
15 20 25 30 

CGT TGT AAA TGC CAC CAC CAC TGT CCG GAA GAC TCA GTC AAC AAT ATC 324 
Arg Cys Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn lie 

35 .40 45 

TGC AGC ACA GAT GGG TAG TGC TTC ACG ATG ATA GAA GAA GAT GAC TCT 372 
Cys Ser Thr Asp Gly Tyr Cys Phe Thr Met lie Glu Glu Asp Asp Ser 
j 50 55 60 

^GGA ATG CCT GTT GTC ACC TCT GGA TGT CTA GGA CTA GAA GGG TCA GAT 420 
= Gly Met Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp 
t 65 70 75 

'TTT CAA TGT CGT GAC ACT CCC ATT CCT CAT CAA AGA AGA TCA ATT GAA 468 
?Phe Gin Cys Arg Asp Thr Pro lie Pro His Gin Arg Arg Ser lie Glu 
80 85 90 

j TGC TGC ACA GAA AGG AAT GAG TGT AAT AAA GAC CTC CAC CCC ACT CTG 516 
1 Cys Cys Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu 
95 100 105 110 

CCT CCT CTC AAG GAC AGA GAT TTT GTT GAT GGG CCC ATA CAC CAC AAG 564 
Pro Pro Leu Lys Asp Arg Asp Phe Val Asp Gly Pro lie His His Lys 

115 120 125 

GCC TTG CTT ATC TCT GTG ACT GTC TGT AGT TTA CTC TTG GTC CTC ATT 612 
Ala Leu Leu lie Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu lie 
130 135 140 

ATT TTA TTC TGT TAG TTC AGG TAT AAA AGA CAA GAA GCC CGA CCT CGG 660 
lie Leu Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg 
145 150 155 

TAG AGC ATT GGG CTG GAG CAG GAC GAG ACA TAC ATT CCT CCT GGA GAG 708 
Tyr Ser lie Gly Leu Glu Gin Asp Glu Thr Tyr lie Pro Pro Gly Glu 
160 165 170 

TCC CTG AGA GAC TTG ATC GAG CAG TCT CAG AGC TCG GGA AGT GGA TCA 756 
Ser Leu Arg Asp Leu lie Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser 
175 180 185 190 
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GGC CTC CCT CTG CTG GTC CAA AGG ACA ATA GCT AAG CAA ATT CAG ATG 804 
Gly Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met 

195 200 205 

GTG AAG CAG ATT GGA AAA GGC CGC TAT GGC GAG GTG TGG ATG GGA AAG 852 
Val Lys Gin lie Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys 
210 215 220 

TGG CGT GGA GAA AAG GTG GCT GTG AAA GTG TTC TTC ACC ACG GAG GAA 900 
Trp Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu 
225 230 235 

GCC AGC TGG TTC CGA GAG ACT GAG ATA TAT CAG ACG GTC CTG ATG CGG 948 
Ala Ser Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg 
240 245 250 

CAT GAG AAT ATT CTG GGG TTC ATT GCT GCA GAT ATC AAA GGG ACT GGG 996 
His Glu Asn lie Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly 
255 260 265 270 

ygrCC TGG ACT CAG TTG TAC CTC ATC ACA GAC TAT CAT GAA AAC GGC TCC 1044 
QSer Trp Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser 
hj 275 280 285 

yCTT TAT GAC TAT CTG AAA TCC ACC ACC TTA GAC GCA AAG TCC ATG CTG 1092 
vLeu Tyr Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu 
290 295 300 

LAAG CTA GCC TAC TCC TCT GTC AGC GGC CTA TGC CAT TTA CAC ACG GAA 1140 

r%ys Leu Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu 

j*J 305 310 315 

WATC TTT AGC ACT CAA GGC AAG CCA GCA ATC GCC CAT CGA GAC TTG AAA 1188 

^Ile Phe Ser Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys 

SI 320 325 330 

AGT AAA AAC ATC CTG GTG AAG AAA AAT GGA ACT TGC TGC ATA GCA GAC 123 6 

Ser Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys lie Ala Asp 
335 340 345 350 

CTG GGC TTG GCT GTC AAG TTC ATT AGT GAC ACA AAT GAG GTT GAC ATC 12 84 

Leu Gly Leu r Ala Val Lys Phe lie Ser Asp Thr Asn Glu Val Asp lie 

355 360 365 

CCA CCC AAC ACC CGG GTT GGC ACC AAG CGC TAT ATG CCT CCA GAA GTG 1332 
Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val 
370 375 380 

CTG GAC GAG AGC TTG AAT AGA AAC CAT TTC CAG TCC TAC ATT ATG GCT 13 80 

Leu Asp Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr He Met Ala 
385 390 395 



88 



GAC ATG TAC AGC TTT GGA CTC ATC CTC TGG GAG ATT GCA AGG AGA TGT 142 8 

Asp Met Tyr Ser Phe Gly Leu lie Leu Trp Glu lie Ala Arg Arg Cys 
400 405 410 

GTT TCT GGA GGT ATA GTG GAA GAA TAC CAG CTT CCC TAT CAC GAC CTG 147 6 

Val Ser Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 
415 420 425 430 

GTG CCC AGT GAC CCT TCT TAT GAG GAC ATG AGA GAA ATT GTG TGC ATG 1524 
Val Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu lie Val Cys Met 

435 440 445 

AAG AAG TTA CGG CCT TCA TTC CCC AAT CGA TGG AGC AGT GAT GAG TGT 1572 
Lys Lys Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys 
450 455 460 

CTC AGG CAG ATG GGG AAG CTT ATG ACA GAG TGC TGG GCG CAG AAT CCT 1620 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro 
„ 465 470 475 

^3CC TCC AGG CTG ACG GCC CTG AGA GTT AAG AAA ACC CTT GCC AAA ATG 1668 
Hkla Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 
I** 480 485 490 

HTCA GAG TCC CAG GAC ATT AAA CTC TGACGTCAGA TACTTGTGGA CAGAGCAAGA 1722 
^Jser Glu Ser Gin Asp lie Lys Leu 
SJ495 500 

QATTTCACAGA AGCATCGTTA GCCCAAGCCT TGAACGTTAG CCTACTGCCC AGTGAGTTCA 1782 
MGACTTTCCTG GAAGAGAGCA CGGTGGGCAG ACACAGAGGA ACCCAGAAAC ACGGATTCAT 1842 
CATGGCTTTC TGAGGAGGAG AAACTGTTTG GGTAACTTGT TCAAGATATG ATGCATGTTG 1902 
" CTTTCTAAGA AAGCCCTGTA TTTTGAATTA CCATTTTTTT ATAAAAAAAA 1952 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys Lys Glu 
15 10 15 



Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys lie Leu Arg Cys 
20 25 30 
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Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn lie Cys Ser 
35 40 45 

Thr Asp Gly Tyr Cys Phe Thr Met lie Glu Glu Asp Asp Ser Gly Met 
50 55 60 

Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp Phe Gin 
65 70 75 80 

Cys Arg Asp Thr Pro lie Pro His Gin Arg Arg Ser lie Glu Cys Cys 

85 90 95 

Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu Pro Pro 
100 105 110 

Leu Lys Asp Arg Asp Phe Val Asp Gly Pro lie His His Lys Ala Leu 
115 120 125 

CLeu lie Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu lie lie Leu 

tfj 130 135 140 

yJPhe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg Tyr Ser 

yJL45 150 155 160 

kjlle Gly Leu Glu Gin Asp Glu Thr Tyr lie Pro Pro Gly Glu Ser Leu 
vi 165 170 175 

i-Arg Asp Leu lie Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu 
rj 180 185 190 

HPro Leu Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met Val Lys 
^ 195 200 205 

ffiGln lie Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg 
210 215 220 

Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser 
225 230 235 240 

Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His Glu 

245 250 255 

Asn lie Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly Ser Trp 
260 265 270 

Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr 
275 280 285 

Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu 
290 295 300 



Ala Tyr Ser Ser 
305 

Ser Thr Gin Gly 



Asn lie Leu Val 
340 

Leu Ala Val Lys 
355 

Asn Thr Arg Val 
370 

Glu Ser Leu Asn 
385 

HTyr Ser Phe Gly 



W31y Gly lie Val 
fi 420 

SlSer Asp Pro Ser 
SI 435 

QLeu Arg Pro Ser 
hj 450 

L = Gln Met Gly Lys 

!S465 

TEST 

^Arg Leu Thr Ala 



Val Ser Gly Leu 
310 

Lys Pro Ala lie 
325 

Lys Lys Asn Gly 



Phe lie Ser Asp 
360 

Gly Thr Lys Arg 
375 

Arg Asn His Phe 
390 

Leu lie Leu Trp 
405 

Glu Glu Tyr Gin 



Tyr Glu Asp Met 
440 

Phe Pro Asn Arg 
455 

Leu Met Thr Glu 
470 

Leu Arg Val Lys 
485 
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Cys His Leu His 
315 

Ala His Arg Asp 
330 



Thr Cys Cys lie 
345 

Thr Asn Glu Val 



Tyr Met Pro Pro 
380 

Gin Ser Tyr lie 
395 

Glu lie Ala Arg 
410 

Leu Pro Tyr His 
425 

Arg Glu lie Val 



Trp Ser Ser Asp 
460 

Cys Trp Ala Gin 
475 

Lys Thr Leu Ala 
490 




Thr Glu lie Phe 
320 

Leu Lys Ser Lys 
335 

Ala Asp Leu Gly 
350 

Asp lie Pro Pro 
365 

Glu Val Leu Asp 



Met Ala Asp Met 
400 

Arg Cys Val Ser 
415 

Asp Leu Val Pro 
430 

Cys Met Lys Lys 
445 

Glu Cys Leu Arg 



Asn Pro Ala Ser 
480 

Lys Met Ser Glu 
495 



Ser Gin Asp lie Lys Leu 
500 
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(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



GCGGATCCTG TTGTGAAGGN AATATGTG 



(2) INFORMATION FOR SEQ ID NO : 20: 

O (i) SEQUENCE CHARACTERISTICS: 
\D (A) LENGTH: 24 base pairs 

Q (B) TYPE: nucleic acid 

yj (C) STRANDEDNESS: single 

£ (D) TOPOLOGY: linear 

U (ii) MOLECULE TYPE: cDNA 

Si (iii) HYPOTHETICAL: NO 

Sj (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

rlGCGATCCGTC GCAGT CAAAA TTTT 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 



GCGGATCCGC GATATATTAA AAGCAA 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CGGAATTCTG GTGCCATATA 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 
_ (A) LENGTH: 37 base pairs 

S (B) TYPE: nucleic acid 

y3 (C) STRANDEDNESS: single 

□ (D) TOPOLOGY: linear 

y (ii) MOLECULE TYPE: cDNA 

=5 (iii) HYPOTHETICAL: NO 

U (iv) ANTI -SENSE: NO 

Sj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

"ATTCAAGGGC ACATCAACTT CATTTGTGTC ACTGTTG 



r.(2) INFORMATION FOR SEQ ID NO: 24: 
Uf (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 
W (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



GCGGATCCAC CATGGCGGAG TCGGCC 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACACCGGGC CGGCGATGAT 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
3 (A) LENGTH: 6 amino acids 

% (B) TYPE: amino acid 

\ (D) TOPOLOGY: linear 

\ (ii) MOLECULE TYPE: peptide 

\ (v) FRAGMENT TYPE: internal 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Gly Xaa Gly Xaa Xaa Gly 
1 5 



H^(2) INFORMATION FOR SEQ ID NO: 27: 
W (i) SEQUENCE CHARACTERISTICS: 

=D (A) LENGTH: 6 amino acids 

d (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



Asp Phe Lys Ser Arg Asn 
1 5 
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INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: \amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



28 



Asp Leu Lys Ser \iys Asn 
1 



INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 

(B) TYPE: ami: 
(D) TOPOLOGY: 
MOLECULE TYPE 



(ii) 
(xi) 



amino acids 
o acid 
linear 
peptide 



SEQUENCE DESCRIPTION: SEQ ID NO: 2 9 



Gly Thr Lys Arg TyrlMet 
1 5 \ 



